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Abstract 

<N : 

J> ■ We present next-to-leading order QCD predictions for a variety of distributions in W + 3-jet 

^t- : 

production at both the Tevatron and the Large Hadron Collider. We include all subprocesses and 

G\ ■ 

incorporate the decay of the W boson into leptons. Our results are in excellent agreement with 
existing Tevatron data and provide the first quantitatively precise next-to-leading order predictions 

o\ : 

for the LHC. We include all terms in an expansion in the number of colors, confirming that 



X 



the specific leading-color approximation used in our previous study is accurate to within three 



percent. The dependence of the cross section on renormalization and factorization scales is reduced 
significantly with respect to a leading-order calculation. We study different dynamical scale choices, 
and find that the total transverse energy is significantly better than choices used in previous 
phenomenological studies. We compute the one-loop matrix elements using on-shell methods, as 
numerically implemented in the BlackHat code. The remaining parts of the calculation, including 
generation of the real-emission contributions and integration over phase space, are handled by the 
SHERPA package. 

PACS numbers: 12.38.-t, 12.38.Bx, 13.87.-a, 14.70.-e, 14.70.Fm, 11.15.-q, ll.15.Bt, 11.55.-m 
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I. INTRODUCTION 



The upcoming start of physics runs at the Large Hadron Collider (LHC) has added im- 
petus to the long-standing quest to improve theoretical control over Standard-Model back- 
grounds to new physics searches at hadron colliders. Some backgrounds can be understood 
without much theoretical input. For example, a light Higgs boson decaying into two photons 
produces a narrow bump in the di-photon invariant mass, from which the large but smooth 
QCD background can be subtracted experimentally using sideband information. However, 
for many searches the signals are excesses in broader distributions of jets, along with missing 
energy and charged leptons or photons; such searches require a much more detailed theo- 
retical understanding of the QCD backgrounds. A classic example is the production of a 
Higgs boson in association with a W boson at the Tevatron, with the Higgs decaying to a bb 
pair, and the W decaying to a charged lepton and a neutrino. The peak in the bb invariant 
mass is much broader than in the di-photon one; therefore variations in the backgrounds, 
including QCD production of Wbb, across the signal region are more difficult to assess. 

In this paper, we focus on a related important class of backgrounds, production of multiple 
(untagged) jets in association with a W boson. Such events, with a leptonically decaying W, 
form a background to supersymmetry searches at the LHC that require a lepton, missing 
transverse energy and jets jj. If the lepton is missed, they also contribute to the background 
for similar searches not requiring a lepton. The rate of events containing a W along with 
multiple jets can be used to calibrate the corresponding rate for Z production with multiple 
jets, which form another important source of missing transverse energy when the Z decays 
to a pair of neutrinos. Analysis of W plus multi-jet production will also assist in separating 
these events from the production of top-quark pairs, so that more detailed studies of the 
latter can be performed. 

The first step toward a theoretical understanding of QCD backgrounds is the evalua- 
tion of the cross section at leading order (LO) in the strong coupling as- Our particular 
focus is on high let multiplicity in association with vector boson production. Many com- 
puter codes [2|, y, IJ] are available to generate predictions at leading order. Some of the 
codes incorporate higher-multiplicity leading-order matrix elements into parton showering 
programs [fa, |6|], using matching (or merging) procedures 7|, |8j. LO predictions suffer from 
large renormalization- and factorization-scale dependence, growing with increasing jet mul- 
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tiplicity, and already up to a factor of two in the processes we shall study. Next-to-leading 
order (NLO) corrections are necessary to obtain quantitatively reliable predictions. They 
typically display a greatly reduced scale dependence Fixed-order results at NLO can 
also be matched to parton showers. This has been done for a growing list of processes within 
the MC@NLO program and the POWHEG method [h]]. It would be desirable to extend 
this matching to higher-multiplicity processes such as those we discuss in the present paper. 

The production of massive vector bosons in association with jets at hadron colliders has 
been the subject of theoretical studies for over three decades. Early studies were of large 
transverse- momentum muon-pair production at leading order llj, followed by the leading- 

^ ^ fOT w + — Q , d ^ 

studies [13|, HJ]. The early leading-order studies were followed by NLO predictions for vector 



boson production in association with a single jet [15|, 1161 ] . Leading-order results for vector- 
boson production accompanied by three or four jets appeared soon thereafte r |17t| . These 
theoretical studies played an important role in the discovery of the top quark [18] . Modern 
matrix element generators allow for even larger numbers of final-state jets at LO. 

The one-loop matrix elements for W + 2-jet a nd Z + 2-jet production were determined [3] 



via the unitarity method [20] (see also ref. |2l|), and incorporated into the parton- level 
MCFM [22] code. 

Studies of W production in association with heavy quarks have also been performed. 
NLO results for hadronic production of a W and a charm quark first appeared in ref. j^]. 
More recently, NLO results have been presented for W + 6 + jet production 24], as well 
as for Wbb production with full b quark mass effects [25j. The last two computations were 
combined to produce a full description of W production in association with a single 6-jet in 



ref. 



26]. 



NLO studies of W production in association with more jets have long been desirable. 
However, a bottleneck to these studies was posed by one-loop amplitudes involving six or 
more partons [9(. On-shell methods [27J, which exploit unitarity and recursion relations, 
have successfully broken this bottleneck, by avoiding gauge-noninvariant intermediate steps, 
and reducing the problem to much smaller elements analogous to tree-level amplitudes. 
Approaches based on Feynman diagrams have also led to new results with six external 



partons, exemplified by the NLO cross section for producing ttbb at hadron colliders 



28]. 



We expect that on-shell methods will be particularly advantageous for processes involving 
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many 



31 



22, 



external gluons, which often dominate multi-jet final states. Various results [29|, |30, 



33j already indicate the suitability of these methods for a general-purpose numerical 



approach to high-multiplicity one-loop amplitudes. 

We recently presented the first NLO results for W + 3-jet production including all sub- 
processes (34]], using one-loop amplitudes obtained by on-shell methods. This study used 
a specific type of leading-color approximation designed to have small corrections — under 
3 percent, as verified in W + 1, 2-jet production — while reducing the required computer 
time. The study was performed for the Tevatron, with the same cuts employed by the CDF 
collaboration in their measurement of W + n-jet production 35[. The NLO corrections 
show a much-reduced dependence on the renormalization and factorization scales, and ex- 
cellent agreement with the CDF data for the distribution in the transverse energy E? of the 
third-most energetic jet. 

In the present paper, we continue our study of W + 3-jet production. We present results 
for W + 3-jet production at the LHC as well as at the Tevatron. As before, we include all 
subprocesses and take all quarks to be massless. (We do not include top-quark contributions, 
but expect them to be very small for the distributions we shall present.) We extend the 
previous results by including specific virtual contributions that are subleading in the number 
of colors, which we had previously neglected. We shall demonstrate that, as expected, these 
subleading-color corrections to cross sections and distributions are uniformly small, generally 
under three percent. We present three sets of distributions at the Tevatron: the Et of the 



third most energetic jet, the total transverse energy Ht [36(, and the di-jet invariant masses. 
These distributions are again computed with the same cuts used by CDF. (As discussed 
further in section 1111} we used the infrared-safe SIS Cone jet algorithm 37| in place of 
JETCLU, the cone algorithm used by CDF.) The code we use is general-purpose, permitting 
the analysis at NLO of any infrared-safe observable in W + 3-jet events. We also present a 
wide variety of distributions for the ultimate LHC energy of 14 TeV. We find that all the 
NLO cross sections and distributions display the expected reduction in renormalization- and 
factorization-scale dependence compared to the same quantities calculated at leading order. 

The shapes of distributions at leading order are quite sensitive to the functional form 
of the scale choice. As expected, the change in shape between LO and NLO distributions 
can be reduced by choosing typical energy scales event-by-event for the renormalization 
and factorization scales, as noted by many authors over the years Ij], Q, 38]. The vector 
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boson transverse energy 



and comparisons with data 



emp 



14 



34 



oyed as an event-by-event scale in previous predictions 



35 



391 ]. turns out to be a poor characterization of the 



typical energy scale for events with large jet transverse energies, as at the LHC We find that 
the total partonic transverse energy is a much better choice. Recently, similar deficiencies 
in the scale choice of Ejf at LO have been observed independently, and another variable, 
related to the invariant mass of the final-state jets, has been proposed as a replacement 40]. 
Here we go further and demonstrate that for LHC energies, is a poor scale choice not 
only at LO but also at NLO, yielding negative differential cross sections in the tails of some 
distributions. This pathology arises from large residual logarithms induced by disparities 
between momentum-transfer scales in multi-jet processes and the value of Ejf . 

For W + 3-jet production, choosing the total partonic transverse energy as the scale 
gives rise to shapes of distributions at LO that are typically similar to those at NLO. 
For a few W + 3-jet distributions genuine NLO effects are present, and significant shape 
changes remain between LO and NLO. These differences are usually less pronounced than 
in W + 1, 2-jet production. In the latter cases, the LO kinematics are more constrained, 
leading to significantly larger NLO corrections. In any event, an accurate description of the 
shape of any distribution requires an NLO computation, either to confirm that its shape is 
unmodified compared to LO, or to quantitatively determine the shape change. 

Ellis et al. have recently presented partial NLO results for W + 3-jet production. Their 
first calculation [4l| was restricted to leading-color contributions to two-quark subprocesses, 
rendering it unsuitable for phenomenological studies. Their version of the leading-color 
approximation drops subleading-color terms in both the virtual and real-emission contribu- 
tions. Quite recently 42| the same authors have added the leading-color contributions from 
four-quark processes, folding in the decay of the W in the zero-width approximation. They 
extended their leading-color approximation to include n/-dependent terms, and estimated 
the full-color result based on the leading-order ratio of the full-color (FC) and leading-color 
(LC) cross sections. The value of the double ratio ( (T nlo,fc/ .nlo > lC)^ (7 lo,fc/ (7 lo,lc) they 

use implicitly is quite sensitive to the inclusion of n/ terms, and as noted by the authors, 
sensitive to cancellations between the two-quark and four-quark contributions. It is nonethe- 
less interesting that their estimate for the total cross section is within a few percent of both 



our earlier result 



34| and the full-color one presented in this paper. It would be interesting 



to test their estimates for various distributions against the complete results presented here; 
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we leave such a comparison to future work. 

Next-to-leading order cross sections are built from several ingredients: virtual correc- 
tions, computed from the interference of tree-level and one-loop amplitudes; real-emission 
corrections; and a mechanism for isolating and integrating the infrared singularities in the 
latter. We evaluate the one-loop amplitudes needed for W + 3-jet production at NLO us- 
ing the BlackHat library 30J. This library implements on-shell methods for one-loop 



amplitudes numericall y. Re 
3 



grams 



29 



31 



32 



43 



ated methods have been implemented in several other pro- 
44j . A numerical approach to amplitudes requires attention to 
numerical instabilities induced by round-off error. We have previously verified BlackHat's 
stability for one-loop six-, seven- and eight-gluon amplitudes (3^], and for leading-color 
amplitudes for a vector boson with up to five partons 45j, using a flat distribution of 
phase-space points. In the present work, we confirm the stability of BLACKHAT-computed 
matrix elements for an ensemble of points chosen in the same way as in the actual numerical 
integration of the cross section. 

The real-emission corrections to the LO process arise from tree-level amplitudes with 
one additional parton: an additional gluon, or a quark-antiquark pair replacing a gluon. 
To isolate and cancel the infrared divergences that arise in the integration of these terms, 
we use the Catani-Seymour dipole subtraction method 46(, as implemented |47j] in the 
program AMEGIC+- 1- 4], itself part of the SHERPA framework [{J. (We also use 
AMEGICH — h for the required tree-level matrix elements.) Other automated implemen- 
tations of the dipole subtraction method have been presented recently 48]. 

The smallness of the subleading-color corrections to the specific leading-color approxi- 
mation employed in ref. 34| allows us to use a "color-expansion sampling" approach 49]. 
In this approach the subleading-color terms, while more time-consuming per phase-space 
point, are sampled with lower statistics than the leading-color ones, and therefore do not 
impose an undue burden on the computer time required. 

This paper is organized as follows. In section [III we summarize our calculational setup, 
and demonstrate the numerical stability of the one- loop matrix elements. In section III II we 
present results for the Tevatron, using the same experimental cuts as CDF. In section IIVI 
we discuss scale choices, showing that the choice of W transverse energy typically used for 
Tevatron studies can lead to significant distortions in the shapes of distributions over the 
broader range of kinematics accessible at the LHC We advocate instead the use of scale 
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choices that more accurately reflect typical energy scales in the process, such as the total 
partonic transverse energy, or a fixed fraction of it. In section |V] we present a wide variety 
of distributions for the LHC We highlight two particular topics in subsequent sections. 
Section [VI] examines properties of the leptons produced by W decay in W + 3 jet events. 
The different pseudorapidity distributions for electrons and positrons are presented. Then 
we show the ratios, between W + and W~ , of the transverse energy distributions for both 
the charged leptons and neutrinos. These two ratios have strikingly different behavior at 
large E T , presumably due to the effects of W polarization. In section IVHl we present results 
for the jet-emission probability, as a function of the pseudorapidity separation of the leading 
two jets. These results are relevant for searches for the Higgs boson in vector-boson fusion 
production. In section IVIIfl we discuss the specific leading-color approximation used in our 
previous study, and our approach to computing the subleading-color terms. We give our 
conclusions in section HXl Finally, in an appendix we give values of squared matrix elements 
at a selected point in phase space. 

II. CALCULATIONAL SETUP 

At NLO, the W + 3-jet computation can be divided into four distinct contributions: 

• the leading-order contribution, requiring the tree-level W + 5-parton matrix elements; 

• the virtual contribution, requiring the one-loop W + 5-parton matrix elements (built 
from the interference of one-loop and tree amplitudes); 

• the subtracted real-emission contribution, requiring the tree-level W + 6-parton matrix 
elements, an approximation capturing their singular behavior, and integration of the 
difference over the additional-emission phase space; 

• the integrated approximation (real-subtraction term), whose infrared- singular terms 
must cancel the infrared singularities in the virtual contribution. 

Each of these contributions must be integrated over the final-state phase space, after impos- 
ing appropriate cuts, and convoluted with the initial-state parton distribution functions. 
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We evaluate these different contributions using a number of tools. We compute the virtual 
corrections using on-shell methods, implemented numerically in BlackHat, as outlined 
below. The subtraction term is built using Catani-Seymour dipoles 46( as implemented 
in AMEGIC++ {4]. This matrix-element generator is part of the SHERPA package 
AMEGICH — h also provides our tree-level matrix elements. The phase-space integration 
is handled by SHERPA, using a multi-channel approach 50] . The SHERPA framework 
makes it simple to include various experimental cuts on phase space, and to construct and 
analyze a wide variety of distributions. With this setup, it is straightforward to make NLO 
predictions for any infrared-safe physical observable. We refer the reader to refs. 0, E,!^] for 
descriptions of AMEGIC+ + , SHERPA and the implementation of the Catani-Seymour 
dipole subtraction method. 



A. Subprocesses 

The W + 3-jet process, followed by leptonic W decay, 

Tevatron : pp ~) 

\ — ► W ± + 3 jets -> e ±, !7 e + 3 jets , (2.1) 
LHC : pp J 

receives contributions from several partonic subprocesses. At leading order, and in the 
virtual NLO contributions, these subprocesses are all obtained from 

q q'QQg^W ± ^e ± u e , (2.2) 
qq'ggg^W ± ^e ± 'v e , (2.3) 

by crossing three of the partons into the final state. The W couples to the q—q' line. We 
include the decay of the vector boson (W^) into a lepton pair at the amplitude level. The 
W can be off shell; the lepton-pair invariant mass is drawn from a relativistic Breit-Wigner 
distribution whose width is determined by the W decay rate IV- For definiteness we present 
results for W bosons decaying to either electrons or positrons (plus neutrinos). We take the 
leptonic decay products to be massless; in this approximation the corresponding results 
for fi (and r) final states are of course identical. Amplitudes containing identical quarks 
are generated by antisymmetrizing in the exchange of appropriate q and Q labels. The 
light quarks, u, d, c, s, b, are all treated as massless. We do not include contributions to the 
amplitudes from a real or virtual top quark; its omission should have a very small effect on 
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the overall result. Except as noted below, we use the same setup for the results we report 
for W + 1, 2-jet production. 



B. Color Organization of Virtual Matrix Elements 

To compute the production of W + 3 jets at NLO, we need the one-loop amplitudes 
for the processes listed in eqs. ( 12. 2p and (12. 3p . Amplitudes in gauge theories are naturally 
decomposed into a sum over permutations of terms; each term is the product of a color 
factor and a color-independent kinematic factor called a partial or color-ordered amplitude. 
It is convenient to decompose the one-loop amplitudes further, into a set of primitive am- 



plitudes [19j, |51|. These are the basic gauge-invariant building blocks of the amplitude, in 
which the ordering of all colored external legs is fixed, the direction of fermion lines within 
the loop is fixed, and rif terms arising from fermion loops are separated out. In Black- 
Hat, the primitive amplitudes are computed directly using the on-shell methods reviewed 
in the next subsection. The primitive amplitudes are then combined to obtain the partial 
amplitudes. The virtual contributions are assembled by summing over interferences of the 
one-loop partial amplitudes with their tree-level counterparts. 

In organizing the amplitude, it is useful to keep the numbers of colors N c and of flavors rif 
as parameters, setting them to their Standard-Model values only upon evaluation. Matrix 
elements, whether at tree level or at one loop, can be organized in an expansion dictated 
by the N c — > oo limit. In this expansion, the standard "leading-color" contribution is 
the coefficient of the leading power of N c , and "subleading-color" refers to terms that are 
suppressed by at least one power of either 1/N%, or Uf/N c from virtual quark loops. (The 
expansion in either quantity terminates at finite order, so if all terms are kept, the result is 
exact in 1/N C .) 

Only one primitive amplitude contributes at leading order in 1/N C to each leading-color 
partial amplitude. Fig. [1] shows sample "parent" color-ordered Feynman diagrams for the 
leading-color primitive amplitudes needed for W + 3-jet production. Other diagrams con- 
tributing to a given primitive amplitude have fewer propagators in the loop. They can be 
obtained from the diagrams shown by moving vertices off of the loop onto trees attached 
to the loop, or by using four-gluon vertices, while preserving the cyclic (color) ordering of 
the external legs and the planar topology of the diagram. In the leading-color primitive 
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FIG. 1: Representative diagrams contributing at leading order in an expansion in the number of 
colors to the qg — ► evq'gg and qQ — > ev q'gQ one-loop amplitudes. The ev pair couples to the 
quarks via a TV boson. 




FIG. 2: Representative diagrams contributing only at subleading order in an expansion in the 
number of colors to the qg — > ev q'gg and qQ — > ev q'gQ one- loop amplitudes. In such contributions, 
either an external gluon, or a gluon splitting to a QQ pair, is emitted from the q-q' line, between 
the W boson and one of the external quarks, q or q' , in the cyclic ordering of the external legs. 

amplitudes, the W boson is between the q and q' external legs, with no other partons in 
between. 

In subleading-color terms, a greater number and variety of primitive amplitudes appear, 
and some primitive amplitudes contribute to more than one subleading-color partial ampli- 
tude. A few of the parent diagrams for subleading-color primitive amplitudes are shown in 
fig. [2j In such diagrams, either another parton appears between the W boson and either 
q or q', or a gluon is emitted between Q and Q in process ( 12.21) . or the diagram contains 
a closed fermion loop. In the present paper, we include all subleading-color contributions. 
In section IVIII[ we discuss in greater detail how to evaluate the full virtual cross section 
efficiently, by taking advantage of the smallness of the subleading-color contributions. 
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C. On-Shell Methods 



The computation of one-loop partonic amplitudes has presented until recently a bottle- 
neck to NLO predictions for hadronic production of four or more final-state objects (jets 

included). The on-shell method has broken this bottleneck. This approach is based on 

□ 

the unitarity method [20(, including its newer refinements, together with on-shell recursion 



relations 



521 ] at one loop [53j. The refinements 



54 



55 



561 . 157] rely on the use of com- 



plex momenta, generalized unitarity and the analytic structure of integrands, as well as 

subtractions to make efficient use of the known basis of one-loop integrals. The one-loop 

matrix elements [3] used by the MCFM program 22] for NLO predictions of W + 2-jet 

production were computed analytically using an early version of this approach, and indeed 

I I 

served to introduce the use of generalized unitarity [58] as an efficient technique for loop 
computations. As applied to hadron colliders, these matrix elements have three final-state 
objects. Feynman-diagram calculations have also reached into this domain [9]. Beyond this, 



improved integral reduction techniques [59 



matrix elements for four final-state objects 



have even made possible the computation of 



28 



601 ] and NLO predictions using them. 



Nonetheless, textbook Feynman-diagrammatic approaches suffer from a factorial increase 
in complexity (or exponential if color ordered) and increasing degree of tensor integrals, with 
increasing number of external legs. The unitarity method for one-loop amplitudes, in con- 
trast, can be cast in a form with only a polynomial increase in complexity per color-ordered 



helicity configuration [3 



31 



6Jj . This suggests that it will have an increasing advantage 



with increasing jet multiplicity. At fixed multiplicity, on-shell methods gain their improved 
efficiency by removing ab initio the cancellation of gauge- variant terms, and eliminating the 
need for tensor-integral (or higher-point integral) reductions. The problem is reduced to 
the computation of certain rational functions of the kinematic variables, to which efficient 
tree-like techniques can be applied. On-shell methods have also led to a host of analytic 
results, including one-loop amplitudes in QCD with an arbitrary number of external legs, for 



particular he 
in refs. Q, E 



i city assignments [53 



611 ] . The reader may find reviews and further references 



62]. 



The BlackHat library implements on-shell methods for one-loop amplitudes numeri 



cally. We have described the computation of amplitudes using BlackHat elsewhere 



30 



45]. 



We limit ourselves here to an overview, along with a discussion of new features that arise 
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when we include subleading-color contributions to the cross section. 

Any one-loop amplitude can be written as a sum of terms containing branch cuts in 
kinematic invariants, C n , and terms free of branch cuts, R n , 



A n — C n + R n 



(2.4) 



The cut part C n can in turn be written as a sum over a basis of scalar integrals 63j 



(2.5) 



The scalar integrals -^34 " " bubbles, triangles, and boxes - - are known functions [64]. 
They contain all the amplitude's branch cuts, packaged inside logarithms and dilogarithms. 
(Massive particles propagating in the loop would require the addition of tadpole contribu- 



tions.) We 



method 



;ake all external momenta to be four dimensional. Following the spinor-helicity 
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651 ]. we can then re-express all external momenta in terms of spinors. The 



coefficients of these integrals, 6«, q, and di, as well as the rational remainder R n , are then all 
rational functions of spinor variables, and more specifically of spinor products. The prob- 
lem of calculating a one-loop amplitude then reduces to the problem of determining these 
rational functions. 

Generalized unitarity improves upon the original unitarity approach by isolating smaller 
sets of terms, hence making use of simpler on-shell amplitudes as basic building blocks. 
Furthermore, by isolating different integrals, it removes the need for integral reductions; 
and by computing the coefficients of scalar integrals directly, it removes the need for tensor 
reductions. Britto, Cachazo and Feng {54] showed how to combine generalized unitarity with 
a twistor-inspired 66j use of complex momenta to express all box coefficients as a simple sum 



of products of tree amplitudes. Forde 



57| showed how to extend the technique to triangle and 



bubble coefficients. His method uses a complex parametrization and isolates the coefficients 
at specific universal poles in the complex plane. It is well suited to analytic calculation. 
Upon trading series expansion at infinity for exact contour integration via discrete Fourier 
summation [30|, the method can be applied to numerical calculation as well, where it is 
intrinsically stable. Generalized unitarity also meshes well with the subtraction approach to 
integral reduction introduced by Ossola, Papadopoulos and Pittau (OPP) [55]. As described 
in ref. (3^], in BlackHat we use Forde's analytic method, adapted to a numerical approach. 
We evaluate the boxes first, then the triangles, followed by the bubbles; the rational terms are 
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computed separately. For each term computed by cuts, we enhance the numerical stability 
of Forde's method by subtracting prior cuts. This is similar in spirit to, though different 
in details from, one aspect of the OPP approach, in which all prior integral coefficients are 
subtracted at each stage. 

The terms R n , which are purely rational in the spinor variables, cannot be computed using 
four- dimensional unitarity methods. At present, there are two main choices for computing 
these contributions within a process-nonspecific numerical program: on-shell recursion, and 



.D-dimensional unitarity. Loop level on-shell recursion 53 
on-shell recursion of Britto, Cachazo, Feng and Witten 



61| is based on the tree-level 



unitarity 



43 



44], 



62 



68 



521 ] . The utility of D-dimensional 



69( | grows out of the original observation [70| by van Neer 



ven that dispersion integrals in dimensional regularization have no subtraction ambiguities. 
Accordingly the unitarity method in D dimensions retains all rational contributions to ampli- 
671 ]. This version of unitarity, in which tree amplitudes are evaluated in D dimensions, 



tudes 



has been used in various analytic 68] and numerical 29 



31 



32 



33, 



43, 



44 



69 



71] studies. 



We have implemented on-shell recursion in BlackHat, along with a "massive continuation" 
approach — related to D-dimensional unitarity — along the lines of Badger's method 72]. 
We speed up the on-shell recursion by explicitly evaluating some spurious poles analytically. 
Both approaches are used for our evaluation of the W + 3-jet virtual matrix elements. For 
producing the plots in this paper, we use on-shell recursion for the computation of primitive 
amplitudes with all negative helicities adjacent. These amplitudes have a simple pattern of 
spurious poles |6l| (poles which cancel between the cut part C n and rational part Rn). For 
them, on-shell recursion is faster than massive continuation in the present implementation. 
BlackHat's use of four-dimensional momenta allows it to rely on powerful four- 



dimensional spinor techniques 



12 



65 



731 ] to express the solutions for the loop momenta 



in generalized unitarity cuts in a numerically stable form 30|. In the computation of the 
rational terms using on-shell recursion, it also allows convenient choices for the complex mo- 
mentum shifts. In four dimensions one can also employ simple forms of the tree amplitudes 
that serve as basic building blocks. While spinor methods arise most naturally in amplitudes 
with massless momenta, it is straightforward to include uncolored massive external states 
such as the W boson [12]]. The methods are in fact quite general, and can also be applied 



usefully to one-loop amplitudes with internal massive particles, or external massive ones 
such as top quarks (treated in the narrow-width approximation) 

HQ- 



is 



With the current version of BlackHat, the evaluation of a complete helicity-summed 
leading-color virtual interference term for a two-quark partonic subprocess (12.31) . built out 
of all the primitive amplitudes, takes 530 ms on average for each phase-space point, on a 
2.33 GHz Xeon processor. The evaluation of a complete four-quark partonic subprocess (12. 2p 
with distinct quarks takes 185 ms (identical quarks take twice as long). The mix of sub- 
processes leads to an evaluation time of 470 ms on average for each phase-space point. (As 
described in section III in performing the phase-space integration we sample a single sub- 
process at each point.) Using the "color-expansion sampling" approach we shall discuss 
in section IVIIIl evaluating the subleading-color contributions would multiply this time by 
about 2.4, giving an average evaluation time of 1.1 s for the full color calculation. 



D. Numerical Stability of Virtual Contributions 

BlackHat computes matrix elements numerically using on-shell methods. In certain 
regions of phase space, particularly near the vanishing loci of Gram determinants associated 
with the scalar integrals I\ 3 4 , there can be large cancellations between different terms in 
the expansion (I2.5P of the cut part C n , or between the cut part and the rational part R n 
in eq. (12 .4p . There can also be numerical instabilities in individual terms. For example, 
the recursive evaluation of R n includes a contribution from residues at spurious poles in 
the complex plane. These residues are computed by sampling points near the pole, in an 
approximation to a contour integral which can be spoiled if another pole is nearby. 

In normal operation, BlackHat performs a series of tests to detect any unacceptable 
loss of precision. Whenever BlackHat detects such a loss, it re-evaluates the problematic 
contributions to the amplitude (and only those terms) at that phase-space point using higher- 



precision arithmetic (performed by the QD package [75j)- This approach avoids the need to 
analyze in detail the precise origin of instabilities and to devise workarounds for each case. 
It does of course require that results be sufficiently stable, so that the use of higher precision 
is infrequent enough to incur only a modest increase in the overall evaluation time; this is 
indeed the case. 

The simplest test of stability is checking whether the known infrared sing ularity of a 
given matrix element has been reproduced correctly. As explained in ref. [30], this check 
can be extended naturally to check individual spurious-pole residues. Another test checks 
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the accuracy of the vanishing of certain higher-rank tensor coefficients. From the interaction 
terms in the (renormalizable) QCD Lagrangian we know on general grounds which high-rank 
tensor coefficients have to vanish. All tensors with rank greater than m must vanish, for the 
m-point integrals with m — 2, 3, 4. If the integral corresponds to a cut line that is fermionic, 
then the maximum rank is reduced by one. In our approach the values of the higher-rank 
tensor coefficients may be computed without much extra cost in computation time. For a 
given generalized unitarity cut, when using complex loop- momentum parametrizations along 
the lines of ref. 30], these tensor coefficients appear as coefficients of specific monomials in 



the complex parameters. Their values may be extracted as a byproduct of evaluating the 
scalar integral coefficients. Similarly, in the massive continuation approach to computing 
the rational terms, particular tensor coefficients can be associated with specific monomials 
in the complex parameters and in an auxiliary complex mass parameter entering the loop- 
momentum parametrization. 

We apply the latter check when computing coefficients of scalar bubble integrals, as well 
as bubble contributions to the rational terms in the massive continuation approach. The 
value of this check is twofold. Firstly, it focuses on a small part of the computation, namely 
single bubble coefficients. This allows BlackHat to recompute at higher precision just the 
numerically- unstable contributions, instead of the entire amplitude. By contrast, the above- 
mentioned check of the infrared singularity assesses the precision of the entire cut part C n 
of the given primitive amplitude, and so it requires more recomputation if it fails. Secondly, 
the check applied to the bubble contributions in the massive continuation approach assesses 
the precision of the rational part R n , which is inaccessible to the infrared-singularity check. 

Finally, a further class of tests of numerical precision looks for large cancellations between 
different parts of A n , in particular between C n and R n in eq. (|2.4p . 

We have previously assessed the numerical stability of BlackHat for six-, seven- and 



eight-gluon one-loop amplitudes 



30] , as well as for the leading-color amplitudes for a vector 



boson with up to five partons used in the present study. These earlier studies used a 
flat phase-space distribution. Here we show the stability of BlackHat over phase-space 
points selected in the same way as in the computation of cross sections and distributions. 
As will be discussed in section III F\ the phase-space points are selected using an integration 
grid that has been adapted to the leading-order cross section. 

In fig. [3J we illustrate the numerical stability of the leading-color virtual interference term 
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FIG. 3: The distribution of the relative error in the leading-color virtual cross section for two 
subprocesses, gd — > e~Dddu and gd — > e~9ggu. The phase-space points are selected in the same 
way as those used to compute cross sections at the LHC. The horizontal axis is the logarithm of 
the relative error f|2.6|) between an evaluation by BlackHat, running in production mode, and 
a target expression evaluated using higher precision with at least 32 decimal digits (or up to 64 
decimal digits for unstable points). The vertical axis shows the number of events out of 100,000 
with the corresponding error. The dashed (black) line shows the 1/e 2 term; the solid (red) curve, 
the 1/e term; and the shaded (blue) curve, the finite (e°) term. 



(or squared matrix element), day, summed over colors and over all helicity configurations 
for two subprocesses, gd — ► e~vddu and gd — ^ e~vggu. (The grid here has been adapted 
to each of the subprocesses individually, instead of to the sum over subprocesses.) We have 
checked that the other subprocesses are similarly stable. The horizontal axis of fig. [3] shows 
the logarithmic error, 

. (2.6) 



log 



\da v um 



10 



I j target i 

\da x 



•v I / 

for each of the three components: 1/e 2 , 1/e 1 and e°, where e = (4— D)/2 is the dimensional 
regularization parameter. The targets have been computed by BlackHat using multipreci- 
sion arithmetic with at least 32 decimal digits, and 64 if the point is deemed unstable. The 
overwhelming majority (99.9%) of events are computed to better than one part in 10 4 - 
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that is, to the left of the '—4' mark on the horizontal axis. 

We have also examined distributions in which each bin is weighted by the requisite squared 
matrix element and Jacobian factors. We find that they have quite similar shapes to the 
unweighted distributions shown in fig. [3J This implies that the few events with a relative 
error larger than 10~ 4 make only a small contribution to the total cross section. We have 
verified that the difference between normal and high-precision evaluation in the total cross 
section, as well as bin by bin for all distributions studied, is at least three orders of magnitude 
smaller than the corresponding numerical integration error. 



E. Real-Emission Corrections 






FIG. 4: Representative diagrams for the eight-point tree-level amplitudes, qg — > euq'ggg, qg — > 
euq'gQQ, and qq' — ► ev QiQiQ2Q2- The ev pair couples to the quarks via a W boson. 

In addition to the virtual corrections to the cross section provided by BlackHat, an 
NLO calculation also requires the real-emission corrections to the LO process. These terms 
arise from tree-level amplitudes with one additional parton: an additional gluon, or a quark- 
antiquark pair replacing a gluon. Representative real-emission diagrams are shown in fig. HI 
Infrared singularities develop when the extra parton momentum is integrated over phase- 
space regions unresolved by the jet algorithm or jet cuts. The resulting singular integrals 
cancel against singular terms in the virtual corrections, and against counter-terms associated 
with the evolution of parton distributions. As mentioned above, to carry out these cancel- 



lations, we use the Catani-Seymour dipole subtraction method 461 as implemented |47J in 

„_ r .4 SHfr a. 

mentation of dipole subtraction has already been tested [47j in explicit comparisons against 
the DISENT program [76|. 



The implementation introduces two free parameters, a cu t and adi po ie- The first, a cut , 
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parametrizes the volume of phase space to be cut out around the soft or collinear singularity. 
From an analytic point of view, a cut could be taken to zero, as the cancellation of counter- 
terms against the matrix element's singularities is exact. In numerical implementations, 
however, round-off error can spoil this cancellation. Previous studies have shown that the 
final result is independent of this cut-off parameter once it is sufficiently small {^J . We use 
«cut = 10" 8 . 

The second parameter, adipoie, characterizes a common modification of subtraction terms 



in phase space away from the singularity |77| , restricting the support of a given subtraction 
term to the vicinity of its singularity. This allows the program to compute only a subset of 
dipole terms, as many will now be identically zero at a given phase-space point. Because 
the number of dipole terms is large (scaling as m 3 for processes containing m partons), 
this reduces the computational burden considerably. We run our code with several different 
values of ctdipoie, and check the independence of the final result on the value of adipoie (0 < 
ctdipoie < 1)- For example, the LHC W~ results for adipoie = 0.03 agree with those for 
c^dipoie = 0.01 to better than half a percent, within the integration errors. We have also 
run a large number of other lower-statistics checks demonstrating that cross sections are 
independent of the choice of adi po ie- Our default choice for the LHC is adi po ie = 0.03, while 
for the Tevatron it is ctdipoie = 0.01. 

F. Phase-Space Integration 

Along with the automated generation of matrix elements and dipole terms, SHERPA 
also provides Monte Carlo integration methods. The phase-space generator combines a priori 
knowledge about the behavior of the integrands in phase space with self-adaptive integration 



methods. It employs a multi-channel method in the spirit of ref. 50]. Single channels (phase- 
space parametrizations) are generated by AMEGIC-) — \- together with the tree-level matrix 
elements. Each parametrization reflects the structure of a Feynman amplitude, roughly 
reproducing its resonances, decay kinematics, and its soft and collinear structure. The most 
important phase-space parametrizations, determined by the adapted relative weight within 



the multi-channel setup, are further refined using Vegas 781 ] . 

The phase-space optimization (adaptation of channel weights and Vegas grids) is per- 
formed in independent runs before the actual computation starts. The optimization is done 
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on the sum of all contributing parton-level processes. We refer collectively to all the parame- 
ters of the optimization as the integration grid. Separate integration grids are constructed for 
the LO terms and for the real-emission contributions. To integrate the virtual contributions, 
we re-use the grid constructed for the LO terms. This procedure avoids the computational 
expense of evaluating the virtual terms merely for grid construction. The virtual to LO 
ratio is sufficiently flat across phase space that this results in only a slight inefficiency when 
evaluating distributions. 

Following the initialization phase, the integration grids are frozen. In the ensuing pro- 
duction phase, we sample over subprocesses so that only a single parton-level subprocess is 
evaluated per phase-space point, selected with a probability proportional to its contribution 
to the total cross section. We choose to integrate the real-emission terms over about 10 8 
phase-space points, the leading-color virtual parts over 2 x 10 6 phase-space points and the 
subleading-color virtual parts over 10 5 phase-space points. The LO and real-subtraction 
pieces are run separately with 10 7 points each. These numbers are chosen to achieve a total 
integration error of half a percent or less. For a given choice of scale /i, they give comparable 
running times for the real-emission and virtual contributions. Running times for leading- 
and subleading-color virtual contributions are also comparable. 

G. Couplings and Parton Distributions 

We work to leading order in the electroweak coupling and approximate the Cabibbo- 
Kobayashi-Maskawa (CKM) matrix by the unit matrix. This approximation causes a rather 
small change in total cross sections for the cuts we impose, as estimated by LO evaluations 
using the full CKM matrix. At the Tevatron, the full CKM results are about one percent 
smaller than with the unit CKM matrix; the difference is even smaller at the LHC. We 
express the jy-boson couplings to fermions using the Standard Model input parameters 
shown in table [H The parameter is derived from the others via, 



We use the CTEQ6M [79] parton distribution functions (PDFs) at NLO and the 
CTEQ6L1 set at LO. The value of the strong coupling is fixed accordingly, such that 
a s {M z ) = 0.118 and a s {M z ) = 0.130 at NLO and LO respectively. We evolve a s (/x) 



9w = 



47T«Q ED (M Z ) 

sin 2 W 



(2.7) 
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parameter 


value 


oqed(Mz) 


1/128.802 


M w 


80.419 GeV 


sin 2 W 


0.230 


Tw 


2.06 GeV 


al 


0.4242 (calculated) 



TABLE I: Electroweak parameters used in this work. 

using the QCD beta function for five massless quark flavors for /j, < m t , and six flavors 
for fi > m t . (The CTEQ6 PDFs use a five-flavor scheme for all /i > m^, but we use the 
SHERPA default of six-flavor running above top-quark mass; the effect on the cross section 
is very small, on the order of one percent at larger scales.) At NLO we use two- loop running, 
and at LO, one-loop running. 

H. Kinematics and Observables 

As our calculation is a parton-level one, we do not apply corrections due to non- 
perturbative effects such as those induced by the underlying event or hadronization. CDF 
has studied [35j these corrections at the Tevatron, and found they are under ten percent 
when the n th jet is below 50 GeV, and under five percent at higher E?- 

For completeness we state the definitions of standard kinematic variables used to charac- 
terize scattering events. We denote the angular separation of two objects (partons, jets or 
leptons) by 

Ai?=^/(A0)2 + (Ar / )2, (2.8) 

with A0 the difference in the azimuthal angles, and Arj the difference in the pseudorapidities. 
The pseudorapidity rj is given by 

, ( 6 

77 = — in tan - 

V 2 

where 9 is the polar angle with respect to the beam axis. 
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The transverse energies of massless outgoing partons and leptons, Et = ypl + Py, can 
be summed to give the total partonic transverse energy, H T , of the scattering process, 

H t = Y, E t + e t + e t- (2.10) 

v 

All partons p and leptons are included in Ht, whether or not they are inside jets that pass 
the cuts. We shall see in later sections that the variable Ht represents a good choice for the 
renormalization and factorization scale of a given event. Although the partonic version is 
not directly measurable, for practical purposes as a scale choice, it is essentially equivalent 
(and identical at LO) to the more usual jet-based total transverse energy, 

H T = Y, E T,j + E e T + E T . (2.11) 

j 

The partonic version Ht has the advantage that it is independent of the cuts; thus, loosening 
the cuts will not affect the value of the matrix element, because a renormalization scale of 
Ht will be unaffected. On the other hand, we use the jet-based quantity Ht, which is 
defined to include only jets passing all cuts, to compute observable distributions. Note that 
for W + n-jet production at LO, exactly n jets contribute to eq. (12. lift ; at NLO either n or 
n + 1 jets may contribute. 

The jet four-momenta are computed by summing the four-momenta of all partons that 
are clustered into them, 

pjT=£p*- (2-12) 

The transverse energy is then defined in the usual way, as the energy multiplied by the 
momentum unit vector projected onto the transverse plane, 

E^ = £ jet sinfl jet . (2.13) 

The total transverse energy as defined in eq. (12. lip is intended to match the experimental 
quantity, given by the sum, 



H? p = e t, 3 +E T + $ T , (2. 14) 

3 

where $ T is the missing transverse energy. Jet invariant masses are defined by 

M*=(pr+p?) 2 , (2-15) 
21 



and the jets are always labeled i,j = 1, 2, 3, ... in order of decreasing transverse energy E r p ) 
with 1 being the leading (hardest) jet. The transverse mass of the PF-boson is computed 
from the kinematics of its decay products, W — > eu e , 



Mf = J2E%.E%.(1 - cos(A0 ei ,)) . (2.16) 



I. Checks 



We have carried out numerous checks on our code, ranging from checks of the basic prim- 
itive amplitudes in specific regions of phase space to overall checks of total and differential 
distributions against existing codes. We have compared our results for the total cross section 
for W + 1, 2- jet production (at a fixed scale \x = Myy) with the results obtained from running 
MCFM |22j. Because the publicly available version of MCFM does not allow a cut in M$ 



we eliminated this cut in the comparison, 
used in the latter code obtained from ref. 



We had previously compared the matrix elements 



19( | , to the results produced purely numerically in 



BlackHat.) Agreement at LO and NLO for W + 1-jet production at the LHC is good to 
a per mille level. For W + 2-jet production, at LO we find agreement with MCFM within a 
tenth of a percent, while at NLO, where the numerical integration is more difficult, we find 
agreement to better than half a percent 1 . We find the same level of agreement at NLO at 
the Tevatron, using a different set of cuts 2 . 

We have carried out extensive validations of our code at a finer-grained level. We have 
confirmed that the code reproduces the expected infrared singularities (poles in e) for the 



primitive amplitudes and the full color-dressed one- loop amplitudes [46j, l80( . We have also 
confirmed that the poles in e in the full virtual cross section cancel against those found in 
the integrated real-subtraction terms 0|. 

We checked various factorization limits, both two-particle (collinear) and multi-particle 
poles. These factorization checks are natural in the context of on-shell recursion. This 
method constructs the rational terms using a subset of the collinear and multi-particle 



1 This level of agreement holds only for the most recent MCFM code, version 5.5. We thank John Campbell 
and Keith Ellis for assistance with this comparison. Here we matched MCFM by including approximate 
top-quark loop contributions, as given in ref. [l9|], and we adopted MCFM's electroweak parameter 
conventions. 

2 In performing this comparison, we used a previous version of MCFM. The differences between the two 
versions at the Tevatron should be minor. 
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factorization poles; the behavior in other channels constitutes an independent cross check. 
For the leading-color primitive amplitudes, we verified that all factorization limits of the 
amplitudes are correct. (We also checked that all spurious poles cancel.) For the subleading- 
color primitive amplitudes, we verified the correct behavior as any two parton momenta 
become collinear. We also checked at least one collinear limit for each partial amplitude. 
We had previously computed the leading-color amplitudes for the subprocess (12.31) in 



ref. UM. Ellis et al. 



71] confirmed these values, and also computed the subleading-color 



primitive amplitudes. This evaluation used D-dimensional generalized unitarity 
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a decomposition of the processes in eqs. (12.21) and ( 12. 3ft into primitive amplitudes 



63 
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68], 
5l|, 



and the OPP formalism for obtaining coefficients of basis integrals [55|. We have compared 
the subleading-color primitive amplitudes at a selected phase-space point to the numerical 
values reported in ref. [7ll . and find agreement, up to convention-dependent overall phases. 



Van Hameren, Papadopoulos, and Pittau (HPP) recently computed [33J] the full helicity- 
and color-summed virtual cross section for the subprocess ud — > W + ggg at another phase- 
space point, for an undecayed on-shell W boson and including (small) virtual top-quark 
contributions. They used the OPP formalism and the CutTools 29] and HELAC-1L js], 



331 ] codes. We have compared the full squared matrix element to the result produced by the 
HPP code, with the top-quark contributions removed 3 . We find agreement with their value 
of the ratio of this quantity to the LO cross section 4 . We have also found agreement with 
matrix-element results from the same code, allowing the W boson to decay to leptons, as in 
our setup. We give numerical values of the squared matrix elements for an independent set 
of subprocesses, evaluated at a different phase-space point, in an appendix. 

As mentioned earlier, we verified that the computed values of the virtual terms are 
numerically stable when integrated over grids similar to those used for computing the cross 
section and distributions. We also checked that our integrated results do not depend on 

r 

«di P oie, the unphysical parameter controlling the dipole subtraction [77J], within integration 
uncertainties. 



3 We thank Costas Papadopoulos and Roberto Pittau for providing us with these numbers. 

4 We can recover an undecayed W by integrating over the lepton phase-space; that integral in turn can be 
done to high precision by replacing it with a discrete sum over carefully-chosen points. 
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III. TEVATRON RESULTS 



In this section we present next-to-leading order results for W + 3-jet production in pp 
collisions at =1.96 TeV, the experimental configuration at the Tevatron. We decay the 
W bosons into electrons or positrons (plus neutrinos) in order to match the CDF study 35]. 



In our earlier Letter 



341 ]. we presented results for the third jet's transverse energy (Eq 



distribution as well as the total transverse energy (Hr) distribution. Those calculations 
employed a particular leading-color approximation for the virtual terms [34J . As discussed in 
section IVIIIt this approximation is an excellent one, accurate to within three percent. In the 
present paper, we give complete NLO results for a larger selection of distributions, including 
all subleading-color terms. It would be interesting to compare the new distributions with 
experimental results from both CDF and DO, as they become available. 
We use the same jet cuts as in the CDF analysis [351 ] . 



E^ > 20GeV, 



(3.1) 



Following ref. 



351 ] . we quote total cross sections using a tighter jet cut, E^ > 25 GeV. We 



order jets by E T . Both electron and positron final states are counted, using the same lepton 
cuts as CDF, 



E e T > 20 GeV , 
$ T > 30 GeV , 



\v e \ < i-i, 

Mjf > 20 GeV. 



(3.2) 



(We replace the $ T cut by one on the neutrino E^.) CDF also imposes a minimum AR 
between the charged decay lepton and any jet; the effect of this cut, however, is undone by 
a specific acceptance correction 81] • Accordingly, we do not impose it. 

For the LO and NLO results for the Tevatron we use an event-by-event common renor- 
malization and factorization scale, set equal to the W boson transverse energy, 



(3.3) 



To estimate the scale dependence we choose five values: u/2, fi/ \/2, /i, a/2/z, 2/j,. 

The CDF analysis used the JETCLU cone algorithm 82] with cone radius R = 0.4. This 
algorithm is not generally infrared safe at NLO, so we use the seedless cone algorithm SIS- 



CONE 



371 ] instead. Like other cone-type algorithms, SIS Cone gives rise to jet-production 
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number of jets 


CDF 


LO 


NLO 


1 


53.5 ±5.6 


41.40(0.02)^;^ 


57.83(0.12)^;oo 


2 


6.8 ±1.1 


6.159(0.004)1^ 


7.62(0.04)1^2 


3 


0.84 ±0.24 


0.796(0.001)1^ 


0.882(0.005)lg;?38 



TABLE II: Total inclusive cross sections, in pb, for W + n jets produced at the Tevatron with 



35J . The first column gives the 



W -» ev and £™ th ~ jet > 25 GeV, using the experimental cuts of ref. 
experimental results as measured by CDF. The experimental statistical, systematic and luminosity 
uncertainties have been combined in quadrature. The second column shows LO results, and the 
third column the complete NLO results. In each case, the scale dependence is quoted in super- 
and subscripts and the numerical integration uncertainties in parentheses. 

cross sections that can depend on an overlap threshold or merging parameter, here called 
/. No dependence on / can develop at LO, because such dependence would require the 
presence of partons in the overlap of two cones. The W + 1-jet production cross section 
likewise cannot depend on / at NLO. We set this parameter to 0.5. (Unless stated otherwise 
we take this algorithm and parameter choice as our default.) 

We expect similar results at the partonic level from any infrared-safe cone algorithm. For 
W + 1,2-jet production we have confirmed that distributions using SISCone are within a 



few percent of those obtained with MCFM using the midpoint cone algorithm [83j. (The 
midpoint algorithm is infrared-safe at NLO for W + 1, 2-jet production, but not for W + 3- 
jet production [37J.) The algorithm dependence of W + 3-jet production at the Tevatron at 



42] 



NLO has also been discussed recently by Ellis et al. 

In table [III we collect the results for the total cross section, comparing CDF data to the 
LO and NLO theoretical predictions computed using BlackHat and SHERPA. In both 
cases these are parton-level cross sections. Results from more sophisticated ("enhanced") 
LO analyses incorporating parton showering and matching schemes {?], [si, Is4| may be found 



in ref. 35j; however, large scale dependences still remain. (These calculations make differ- 
ent choices for the scale variation and are not directly comparable to the LO parton-level 
predictions given here.) As in the experimental analysis, we sum the W~ and W + cross 
sections, which are identical at the Tevatron (for forward-backward symmetric acceptance 
cuts). 
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£,3rd-jet 


da/dE T ld ' iet (pb/GeV) 




CDF 


LO 


NLO 


20-25 


0.184 ±0.0394 


n -,01 +0.0769 
U.±Ol_ Q443 


n 1 60+ - 0205 

U.1DU_ 0277 


25-30 


0.087 ± 0.0268 


66+ - 0393 

U.UOD_ 0224 


n n77+ - 0075 

u - u ' ' -0.0126 


30-35 


0.037 ±0.0153 


o n3fi+ - 0216 

U.UJD_ 0123 


n 041 + - 0036 

U.U41_q 006 g 


35-45 


0.020 ±0.0125 


01 7 +0.0103 
u - ul( -0.0058 


n m s+o.0009 
U.U18_ 00027 


45-80 


0.0015 ±0.00177 


o nns2 +0 - 00207 

u.uuoz_ .00114 


n nnoi +0.00015 
U.UU,31_o. 00041 



TABLE III: The differential cross sections, da(pp — ► eu+ > 3-jets) / dE T rA ■' et , for W + 3-jet produc- 
tion at the Tevatron using the experimental cuts (|3.ip and f|3.2|) of ref. [35]. This table corresponds 
to the values plotted in fig. [5j 



We have also computed the W + 2-jet and W + 3-jet total cross sections at NLO with a 
larger merging parameter, / = 0.75. (CDF uses a value of / = 0.75 35], but for a different, 
infrared- unsafe algorithm, JETCLU.) The value of the NLO W + 3-jet production cross 
section of 0.882 pb in table [III then increases to 0.917 pb (about 4%). The W + 2-jet 
production cross section shows a more modest increase from 7.62 pb to 7.69 pb (about 1%). 
Distributions, as for example the ones shown in fig. [5] (see also table IIII|) . follow a similar 
bin-by-bin dependence on /. 

In fig. EL we compare the Et distribution of the second- and third-most energetic jets in 
CDF data 35| to the NLO predictions for W + 2-jet and W + 3-jet production, respectively. 
For convenience, in table II III we collect the data used to construct the third-jet Et plot in 
fig. [51 We include scale-dependence bands obtained as described above. 5 The experimental 
statistical and systematic uncertainties (excluding an overall luminosity uncertainty of 5.8%) 
have been combined in quadrature. The upper panels of fig. [5] show the distribution itself, 
while the lower panels show the ratio of the LO value and of the data to the NLO result for 
the central value of fi = Ejf . Note that we normalize here to the NLO result, not to LO 
as done elsewhere. The LO/NLO curve in the bottom panel represents the inverse of the 



5 We emphasize that the scale- uncertainty bands are only rough estimates of the theoretical error, which 
would properly be given by the difference between an NLO result and one to higher order (next-to-next- 
to-leading order). 
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FIG. 5: The measured cross section, da(pp — * eu+ > n-jets)/<i£ ; ^ th •' et , for inclusive W + n- 
jet production, compared to full NLO predictions for n = 2,3. In the upper panels the NLO 
distribution is the solid (black) histogram, and CDF data points are the (red) points, whose inner 
and outer error bars, respectively, denote the statistical and total uncertainties (excluding the 
luminosity error) on the measurements added in quadrature. The LO predictions are shown as 
dashed (blue) lines. The thin vertical lines in the center of each bin (where visible) give the 
numerical integration errors for that bin. Each lower panel shows the distribution normalized to 
the full NLO prediction, using the CDF experimental bins (that is, averaging over bins in the upper 
panel). The scale-dependence bands are shaded (gray) for NLO and cross-hatched (brown) for LO. 

so-called K factor (NLO to LO ratio). 

We do not include PDF uncertainties in our analysis. For W + 1, 2-jet production at the 
Tevatron these uncertainties have been estimated in ref. [35]. For these processes, they are 
smaller than uncertainties associated with NLO scale dependence at low jet Et, but larger 
at high Et- 

For reference, we also show the LO distributions and corresponding scale-dependence 
bands. The NLO predictions match the data very well, and uniformly (without any difference 
in slope) in all but the highest Et experimental bin. The central values of the LO predictions, 
in contrast, have different shapes from the data. In the upper panels, we have used 5 GeV 
bins to plot the predictions, and have superposed the data points, although CDF used 
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different bins in their analysis. In the lower panel, which shows the ratio of the LO prediction, 
and of the data, to the NLO prediction, we have used the exp erimental bins, which are wider 
at higher Et- A very similar plot was given previously 341 ]. based on a particular leading- 
color approximation. As we discuss in section IVHH those results differ only slightly from 
the complete NLO results presented here. 
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FIG. 6: Theoretical predictions for the Ht distribution in W + 3-jet production at the Tevatron. 
The LO prediction is shown by a dashed (blue) line; the NLO one by a solid (black) one. The 
upper panel shows the distribution itself, while the lower panel shows the distributions and scale- 
dependence bands, cross-hatched (brown) for LO and solid (gray) for NLO, normalized to the NLO 
prediction for /j = Ejf. The numerical integration errors, indicated by thin vertical lines in the 
upper panel, are noticeable only in the tail. 



In fig. [61 we show the distribution for the total transverse energy Ht, given in eq. (12.141) . 
This quantity has been used in top-quark studies, and will play an important role in searches 
for decays of heavy new particles at the LHC The upper panel shows the LO and NLO 
predictions for the distribution, and the lower panel their ratio. The NLO scale-dependence 
band, as estimated using five points, ranges from ±20% around its central value at low Ht 
to ±5% around 400 GeV, and back to around ±10% at 800 GeV. The band is accidentally 
narrow at energies near the middle of graph, because the curves associated with the five 

values converge as the Ht value rises from lower values towards the middle ones. (The 
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fluctuations visible in the tail of the distribution are a reflection of the limited statistics for 
the Monte Carlo integration, as we show a larger dynamical range than in the Et spectrum.) 
The shape of the LO distribution is noticeably different, for any of the \x values, from that 
at NLO. At low Ht, the central LO prediction is 20% below the NLO central value, whereas 
at the largest Ht it is nearly 50% higher. Thus for // = E^f the NLO correction cannot be 
characterized by a constant K factor (ratio of NLO to LO results). We will address some 
of the reasons for the difference in shape in the following section. We note that the NLO 
scale-dependence band has a somewhat different appearance from the corresponding figure 
in ref. 34J , because the latter used wider bins at large Ht and had larger integration errors. 
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FIG. 7: LO and NLO predictions for the di-jet invariant mass distributions M%j (in GeV) in W + 3- 
jet production at the Tevatron. The histograms and bands have the same meaning as in figs. [5] 
and[6l 



In fig. [7J we show the distributions for the three di-jet invariant masses we can form: 
hardest and middle jet M12, hardest and softest jet M13, and middle and softest jet M23. 
The NLO scale-dependence bands are somewhat broader than for the Et or Ht distributions. 
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The distributions become increasingly steep as we move from masses of hardest to softer 
jets. That gross feature is unaltered in passing from LO to NLO, although each distribution 
falls off somewhat faster at NLO, as was the case for H T . 



IV. CHOOSING SCALES 

The renormalization and factorization scales are not physical scales. As such, physical 
quantities should be independent of them. They arise in theoretical calculations as artifacts 
of defining as and the parton distributions, respectively. We will follow the usual practice 
and choose the two to be equal, hr = \ip = fi. The sensitivity of a perturbative result to the 
common scale is due to the truncation of the perturbative expansion; this dependence would 
be canceled by terms at higher orders. NLO calculations greatly reduce this dependence 
compared to LO results, but of course do not eliminate it completely. In practice, we 
must therefore choose this scale. Intuitively, we would expect a good choice for \i to be 
near a "characteristic" momentum scale p for the observable we are computing, in order 
to minimize logarithms in higher-order terms of the form ln(/x/p). The problem is that 
complicated processes such as W + 2, 3-jet production have many intrinsic scales, and it is 
not clear we can distill them into a single number. For any given point in the fully-differential 
cross section, there is a range of scales one could plausibly choose. One could choose a fixed 
scale /i, the same for all events. However, because there can be a large dynamic range in 
momentum scales (particularly at the LHC, where jet transverse energies well above Mw 
are not uncommon), it is natural to pick the scale \x dynamically, on an event-by-event basis, 
as a function of the observable or unobservable parameters of an event. 

A particularly good choice of scale might minimize changes in shape of distributions from 
LO to NLO, such as those visible in figs. [6] and Such a choice might in turn make it 
possible for LO programs incorporating parton showering and hadronization BSQ to be 
more easily reweighted to reflect NLO results. 

Before turning to dynamical scales and kinematic distributions, let us first examine how 
the total cross section depends on a fixed scale. In fig. [8] we display this dependence for the 
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FIG. 8: The scale dependence of the cross section for W + 3-jet production at the Tevatron is 
shown on the left, as a function of the common renormalization and factorization scale (J,, with 
Ho = My/- Similar curves for W~ + 3-jet production at the LHC are shown on the right, except 
that jUo = 2Mn/. In each case, the lower panel shows the K factor. 

Tevatron 6 and the LHC (left and right respectively). We vary the scale between \i = 
and 4M\y for the Tevatron, and between Mjy/2 and 16M^ for the LHC. We must be careful 
to vary the scale in a 'sensible' range. For the NLO calculation in particular, we do not wish 
to reintroduce large logarithms of scales. The figure shows the characteristic increasing-and- 
decreasing of the NLO prediction (see e.g. refs. 85|, |86() as well as the monotonicity of the 
LO one. It also shows a substantial reduction in scale dependence going from LO to NLO. 
The lower panels show the K factor. The large sensitivity of the LO cross section to the 
choice of scale implies a similar large dependence in this ratio. 

We thus see that, as expected, the total cross sections at NLO are much less sensitive 
to variations of the scale than at LO. We now turn to the scale dependence of kinematic 
distributions. In this case the K factor will not only be sensitive to the scale chosen, but it 
will in general depend on the kinematic variable. We will see that a poor choice of scale can 
lead to problems not only at LO, but also at NLO, especially in the tails of distributions. 

The sensitivity to a poor scale choice is already noticeable at the Tevatron, in the shape 
differences between LO and NLO predictions visible in figs. [6] and El However, it becomes 
more pronounced at the LHC because of the larger dynamical range of available jet transverse 
energies. We can diagnose particularly pathological choices of scale using the positivity of 



Note that the Tevatron plot is for £^ rd-jct > 20 GeV, not the cut E^ a ' iet > 25 GeV used in table HJ 
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FIG. 9: The Et distribution of the second jet at LO and NLO, for two dynamical scale choices, 
/i = Ejf (left plot) and [i = Ht (right plot). The histograms and bands have the same meaning 
as in previous figures. The NLO distribution for \i = E^ turns negative beyond Et = 475 GeV. 



the NLO cross section: too low a scale at NLO will make the total cross section unphysically 
negative. 

This diagnostic can be applied bin by bin in distributions. For example, in fig. [9] we show 
the Et distribution of the second-most energetic jet of the three, at the LHC. In the left plot 
we choose the scale to be the W transverse energy (denned in eq. (13.31) ) used earlier in 
the Tevatron analysis. Near an Et of 475 GeV, the NLO prediction for the differential cross 
section turns negative! This is a sign of a poor scale choice, which has re-introduced large 
enough logarithms of scale ratios to overwhelm the LO terms at that jet Et- Its inadequacy 
is also indicated by the large ratio of the LO to NLO distributions at lower Et, and in the 
rapid growth of the NLO scale-dependence band with Et- In contrast, the right panel of 
fig. M shows that Ht (defined in eq. 1 12. lfjp ) provides a sensible choice of scale: the NLO 
cross section stays positive, and the ratio of the LO and NLO distributions, though not 
completely flat, is much more stable. 

Why is n — Ejf such a poor choice of scale for the second jet Et distribution, compared 
with \l = H^ (For an independent, but related discussion of this question, see ref. 40].) 
Consider the two distinct types of W + 3 jet configurations shown in fig. [lOj If configuration 
(a) dominated, then as the jet Et increased, Ejf would increase along with it, by conser- 
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FIG. 10: Two distinct W + 3 jet configurations with rather different scales for the W transverse 
energy. In configuration (a) an energetic W balances the energy of the jets, while in (b) the W is 
relatively soft. Configuration (b) generally dominates over (a) when the jet transverse energies get 
large. 

vation of transverse momentum. However, in configuration (b), the bulk of the transverse 
momentum can be balanced between the first and second jet, with the W and the third 
jet remaining soft. In the tail of the second-jet Et distribution, configuration (b) is highly 
favored kinematically, because it implies a much smaller partonic center-of-mass energy. 
Because Ejf remains small, the wrong scale is being chosen in the tail. Evidence for the 



dominance of configuration (b) over (a) in W + 2-jet production can be found in ref. [401 ]. 
which shows that the two jets become almost back to back as the jet Et cut rises past My/. 
The negative NLO cross section in the left panel of fig. [9] provides evidence of the same 
domination in W + 3-jet production. 

However, configuration (b) also tends to dominate in the tails of generic multi-jet distri- 
butions, such as Ht or My, in which large jet transverse energies are favored. The reason is 
that for jet transverse energies well above Myy, the W behaves like a massless vector boson, 
and so there is a kinematic enhancement when it is soft, as in configuration (b). Exceptions 
would be in distributions such as the transverse energy of the W itself, or of its decay lepton, 
which kinematically favor configuration (a) in their tails. 

In contrast to /j = Ejf , the scale /x = Ht becomes large in the tails of generic multi-jet 
transverse-energy distributions. For the distribution of the second jet Et, this is evident 
from the close agreement between LO and NLO values, shown in the right panel of fig. 
The same features are evident, though less pronounced, in the Ht distributions shown in 
fig. [TTJ The left plot is again for jj, = E^ , and the right plot for \i = Ht- The shapes of the 
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FIG. 11: The Ht distribution for W~ + 3-jet production at the LHC. The scale choices \x = Ej! 
and fj, = Ht are shown, respectively, on the left and the right. The histograms and bands have the 
same meaning as in previous figures. 

LO and NLO distributions for fi = are quite different; the ratio displayed varies from 
around 1 at Ht of 200 GeV to around 2 at Ht near 1200 GeV. In contrast, the ratio for 
fi = Ht is nearly flat. 

These features are not special to the Ht distribution itself. For example, fig. [T21 displays 
the ratio of LO to NLO predictions for two other W + 3-jet distributions for the two scale 
choices. The left panel shows the ratios for the leading di-jet mass, while the right panel 
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FIG. 12: Ratios of LO to NLO predictions for the distributions in the di-jet invariant mass (left 
panel) and AR separation (right panel) for the leading two jets in W~ + 3-jet production at the 
LHC. In each panel, the dashed (red) line gives the scale choice = , while the solid (black) 
line gives the (much flatter) ratio for \i = Ht- 
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shows ratios for the leading AR distribution. Once again the ratios for \x = Ht have a much 
milder dependence than those for /i = Ejf. 

As we shall see further in the next section, the roughly flat ratio for the choice fi = Ht 
holds for a wide variety of distributions. It does not hold for all: some NLO corrections 
cannot be absorbed into a simple redefinition of the renormalization scale. The distribution 
of the second-most energetic jet in fig. provides one example. A second example, discussed 
below, is the H T distribution for W + + 2-jet production in the left plot of fig. [J5J A 
third example (not shown) would be the Ht distribution for W + 1-jet production; this 
case is easy to understand because only configuration (a) (with the second and third jets 
erased) is available at LO, while configuration (b) can dominate at NLO, so effectively a 
new subprocess opens up at NLO. 

Although the scale choice is a poor one as far as the tails of many distributions are 
concerned, we note that it does give reasonable results for the Tevatron and LHC total cross 
sections with our standard jet cuts, which are dominated by modest jet transverse energies. 
For fi = E^, the NLO cross section for W~+ 3-jet production at the LHC is 31.37(0.20)t^ 7 
pb, which has much smaller scale variation than the LO result 37.16(0.07)tio!35 

pb. (The 

parentheses indicate the integration uncertainties, and subscripts and superscripts the scale 
variation.) For /i = Ht, the NLO value is 27.52(0. 14)l2!si pbj the two NLO results are 
consistent within the scale variation band. 
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FIG. 13: The scale dependence of the cross section for W~ + 3-jet production at the LHC as a 
function of n/no, with fiQ = Ht- 
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Accordingly, to have a proper description of distributions, we adopt H T as our default 
choice of scale for W + 3-jet production at the LHC. In fig. [13] we display the scale variation 
of the total cross section, evaluating it at the five scales /i /2, fx /\/2, /i , V2fx , 2fi with 
fio = Ht- As usual, the variation is much smaller at NLO than at LO. Because Ht includes 
a scalar sum, it is somewhat larger than an "average" momentum transfer. One could choose 
a scale lower by a fixed ratio, say Ht/2. This would shift the LO-to-NLO ratio curves in 
figs. [TT] and [121 f° r example, up towards a ratio of 1. It would have only a modest effect 
on the NLO predictions, however, because the scale-dependence curve for the NLO cross 
section is relatively flat. 
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FIG. 14: The first, second, and third jet Et distributions for W + + 3-jet production. The scale 
choice /x = /ihad at LO, shown by the dot-dashed (magenta) lines, is compared with the LO and 
NLO results using our default scale choice \i = Ht, shown respectively in the dashed (blue) and 
solid (black) lines. The scale-dependence bands are shaded (gray) for NLO and cross-hatched 
(brown) for LO. 

It is interesting to compare our default choice fi = Ht with the choice of scale advocated 
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FIG. 15: The distributions for W + + 2,3-jet production. LO results for fj, = /thad are compared 
with LO and NLO results for /i = Ht- The lines and bands have the same meaning as in fig. [HI 



in ref. 40] on the basis of soft-collinear-effective theory, 



^ = \ML d + M^. (4.1) 



In this equation Mh a d is the invariant mass of the jets. (As explained in ref. |40|], the factor 
of 1/4 is a choice, not dictated by a principle.) With this choice, one can greatly reduce 
the shift between LO and NLO in W + 2-jet distributions, compared to more conventional 
choices such as /i = Ejf. We have confirmed that for W + 2-jet production with a few 
exceptions, such as the decay lepton transverse energy, the choice \i = /ihad does fare better 
than fi = Ejf in bringing LO in line with NLO. How does this choice fare in W + 3-jet 
production? To answer this question, we have compared several distributions. In fig. [HJ we 
consider the E T distributions of the first, second and third jets in W + + 3-jet production 
at the LHC We compare the LO results for \x = Ht and /i = /ihad to the reference NLO 
results for /j = Ht- (Any sensible choice of scale at NLO should give very similar results.) 
As can be seen from the figure, the choice /i = Ht leads to a somewhat flatter LO to NLO 
ratio than does \i = /ihad for the first jet, and performs about as well for the second and 
third leading jets. 

It is also instructive to compare the H T distributions for W + 2-jet and W + 3-jet 
production. The left panel of fig. [T5l shows the distribution in W + + 2-jet production; here 
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the scale /i = //had gives an LO result closer to the NLO one. On the other hand, in the right 
panel, which shows the distribution in W + + 3-jet production, the choice \i = Ht gives a 
LO to NLO ratio which is comparably flat to the = /i^ad choice. 
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FIG. 16: The distribution in the positron transverse momentum for W + + 2, 3-jet production at 
the LHC. LO results for \i = /Uhad are compared with LO and NLO results for = Ht- The lines 
and bands have the same meaning as in fig. [Lil 



In contrast, examine the positron pt (or Et) distribution, shown in fig. [16] for W + + 2, 3- 
jet production at the LHC. As can be seen in the lower panel of each plot, the choice /i = Ht 
performs better than \x = //had at LO, in matching the more accurate NLO result at large 
values of E T . The reason is that large E T forces the W transverse energy to be large, which 
in turn favors configuration (a) in fig. [101 in which a relatively low-mass cluster of jets recoils 
against the W boson. Thus the scale \i = /ihad drops below the typical momentum transfer 
in the process. 

In summary, both fj, = /ih a d and /x = Ht are a great improvement over the scale choice 
\i = Erp . For some distributions ji = /ihad is a somewhat better choice at LO than \i = Ht, 
while for other distributions \x = Ht is better. These attributes should not come as a 
surprise, given the multi-scale nature of jet production. 
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V. PREDICTIONS FOR THE LHC 



In this section we present the first complete NLO predictions for W + 3-jet production 
at the LHC. The initial run of the LHC will almost certainly not be at its full design energy 
of 14 TeV, but we choose this energy to simplify comparisons to earlier studies. Most of the 
features visible at 14 TeV would of course remain at the lower energy, such as 10 TeV, of an 



initial run. The production of W + 3 jets at the LHC was also studied at NLO in ref. 41], 
however with a set of subprocesses accounting for only 70% of the cross section; for on-shell 
W bosons; and with a less accurate leading-color approximation than that of ref. [3J]. For 
our analysis of W + 3-jet production at the LHC, we use the following kinematical cuts, 

\rf ct \<3, i? = 0.4, |r/ e |<2.5, £^>20GeV, 

E V T > 30 GeV , > 20 GeV . (5.1) 

We also quote total cross sections with both of the following jet cuts 

> 30 GeV and > 40 GeV . (5.2) 

We show distributions only using the first of these two cuts. We employ the SISCone jet 



algorithm 



everywhere (with / parameter set to 0.5), except for tables [VTl and IVTT1 where 



we use the &t algorithm 87] 



For the LHC we adopt the default factorization and renormalization scale choices, 

fx = H T , (5.3) 

where Ht is defined in eq. (12.101) . As discussed in the previous section, this choice does not 
have the shortcomings of fi = Ejt in describing the large transverse energy tails of generic 
distributions. 

At the LHC, a pp collider, the total rates and the shapes of some distributions are quite 
different for W~ and W + production. At 14 TeV, the qg initial state accounts for over half 
of W+ n-jet production. There are considerably more u quarks than d quarks in the proton 
in the relevant range of the momentum fraction x, leading to greater production of W + than 
W~ . Accordingly, we quote separate results for total cross sections in tables ITVllVHi In 
table M\ we show the W~ + 1, 2, 3-jet cross sections using the SISCone algorithm, for two 
different choices of jet E T cut, 30 and 40 GeV. The corresponding results for + 1, 2, 3-jet 
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Number of jets 


LO 

Elf > 30 GeV 


NLO 

Erf > 30 GeV 


LO 

Elf > 40 GeV 


NLO 

Erf > 40 GeV 


1 


343.29(0.18)1^1 


456.60(1.43)™ 


215.68(0.12)1} 2 ;^ 


298.44(0.77)l^ 2 4 p 


2 


99.78(0.09)™ 


122.71(0.92)1^ 


58.52(0.063)l^ 2 4 f 


72.96(0.54)l 4 i ; 2 ° 


3 


22.28(0.04)1^;^ 


27.52(0.14)1^ 


11.012(0.02)1|^ 


13.96(0.07)1};^ 



TABLE IV: Cross sections for W production using the SISCone jet algorithm, with jet cuts 
Erf > 30 GeV or Elf > 40 GeV. The remaining cuts are as in eq. (|5.1|) . 



Number of jets 


LO 

Elf > 30 GeV 


NLO 

El^ > 30 GeV 


LO 

ES? > 40 GeV 


NLO 

Elf > 40 GeV 


1 


469.37(0.32)1 2 };^ 


615.77(2.04)1 2 |;^ 


301.20(0.22)!}™ 


415.50(1.90)!}^° 


2 


143.91 (O.lS)! 2 .^ 2 


174.28(0.48)1? 5 ^ 7 


86.32(0.12)1}^;!? 


105.99(0.31)1^1 


3 


34.75(0.05)1^ 2 3 ° 6 


41.47(0.27)l 2 |i 


17.64(0.02)1^ 


21.76(0.15)1};! 



TABLE V: Cross sections for W + production using SISCone, with jet cut Erf > 30 GeV or 
Elf > 40 GeV. The remaining cuts are as in eq. (|5.ip . 



Number of jets 


LO 

Elf > 30 GeV 


NLO 

Elf > 30 GeV 


LO 

Erf > 40 GeV 


NLO 

Erf > 40 GeV 


1 


343.29(0.18)1}^! 


444.75(1.44)1^ 2 


215.68(0.12)!} 2 ;^ 


290.44(0.77)l 7 1 5 f 


2 


102.88(0.09)™ 


120.07(0.86)!^ 


59.99(0.06)™ 


70.85(0.42)1 2 ;^ 


3 


25.84(0.05)!^ 


29.29(0.16)!°;! 


12.78(0.02)!^ 


14.89(0.08)1°;^ 



TABLE VI: Cross sections for W production using the kx algorithm (R = 0.4), with jet cut 
Erf? > 30 GeV or Erf > 40 GeV. The remaining cuts are as in eq. (|5,ip . 



Number of jets 


LO 

Erf? > 30 GeV 


NLO 

Elf > 30 GeV 


LO 

Elf > 40 GeV 


NLO 

E^ > 40 GeV 


1 


469.37(0.32)!^};^ 


600.66(2.06)1 2 2|2 


301.20(0.22)!}^ 


405.27(1.91)!}I;i 


2 


148.46(0.19)11;^ 


171.45(0.50)!^ 


88.48(0.12)!}|^ 


103.77(0.31)!|^ 


3 


40.27(0.05)!^° 


44.55(0.28)!^ 


20.45(0.03)!^ 


23.20(0.16)!?;^ 



TABLE VII: Cross sections for W + production using the Ut algorithm, with jet cut Elf > 30 GeV 
or Elf > 40 GeV. The remaining cuts are as in eq. (15. ip . 
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production are given in table |V] In tables IVII and IVHl we show the corresponding results 
for the fa jet algorithm with a pseudo-cone radius of 0.4, for W~ and W + production 
respectively. It is interesting to note that while the NLO cross sections for W + 1,2-jet 
production are larger for the SIS Cone algorithm than for kx (with the algorithm parameters 
we have chosen), the relative size is reversed for W + 3-jet production. (The entries for the 
LO W + 1-jet cross section are identical for the SIS Cone and kx algorithms because the 
same set of events was used to compute them.) 

We next describe NLO results for kinematic distributions. For distributions that do not 
differ appreciably for W~ and W + production, except for overall normalization, we generally 
show a single distribution. 




50 100 150 200 250 300 50 100 150 200 250 



Third Jet E T [ GeV ] Third Jet E T [ GeV ] 

FIG. 17: The E T distributions of the third jet, da(W -> eu+ > 3-jets)/dEy d " jet , at the LHC. 
The left panel shows the case of W~ and the right of W + . 

For the inclusive production of W + 3 jets, a basic quantity to examine is the Et dis- 
tribution for the thirdmost leading jet in Et- This distribution is shown in fig. [T71 As in 
the Tevatron results, the scale uncertainty is considerably reduced at NLO compared to LO. 
With our default choice of scale /x = Ht, the ratio of LO to NLO predictions displayed in 
the lower panels is rather flat over the entire displayed region. (The upward spike in the 
NLO band in the W~ plot at 300 GeV is due to a statistical fluctuation in the evaluation at 
\x = Ht/2.) This plot may be compared to the Et distribution of the second- most energetic 
jet shown in the right panel of fig. M, which undergoes significant shape change between LO 
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and NLO predictions, though less than for the scale choice \x = Ejf- The dynamic range we 
show here is larger than in the corresponding plot for the Tevatron. 
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FIG. 18: The ratio of the Et distribution of the third jet accompanying a W + to the same 
distribution for W~ , evaluated at NLO using /j, = Ht- The two distributions are shown separately 
in fig. [TTJ. The thin vertical lines denote the numerical integration errors. 

In order to examine shape differences between the Et distributions in W + and W~ 
production, in fig. [TBI we show the ratio of the two distributions plotted in fig. [TO The ratio 
is greater than unity at low Et due to the larger total cross section for W + production 
compared to W~ , as given in tables HVl and IVl The ratio increases significantly with Et, on 
the order of 25 percent over the range of the plot, because larger Et forces larger partonic 
center-of-mass energies, and hence larger values of x where the u quark distribution is more 
dominant. 

The Ht distribution also has slightly different shapes for W~ and W + production. The 
right panel of fig. [TT1 shows the Ht distribution in W~ production (with \i = Ht)- The 
corresponding plot for W + is given in the right panel of fig. [15l Across the displayed range, 
the ratio of the NLO W + to W~ distributions (not shown) increases slightly. The increase 
occurs for the same reason as the third jet Et distribution. 

Fig. [T9l shows the differential distributions with respect to di-jet separations ARij. The 
two hardest jets, labeled 1 and 2, are more likely to be produced in a back-to-back fashion, 
leading to a more peaked distribution around n. As in other distributions, the NLO scale- 
dependence band is much smaller than the LO one. The LO and NLO distributions for the 
separation of the leading two jets are somewhat different from each other in shape. This is 
presumably due to the effect of additional radiation allowing kinematic configurations where 
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FIG. 19: The AR distributions for W~ + 3-jet production, from left to right: between the first 
and second, first and third, and second and third jets, using the scale choice = Ht- The AR\2 
distribution between the first and second jets shows a significant shape change in going from LO to 
NLO, while the other two cases are flat. The distributions for W + + 3-jet production are similar. 

the jets are closer together, thereby pushing the weight of the distribution to smaller AR 
values, although the position of the peak is essentially unchanged. The shapes of the other 
two distributions are similar at LO and NLO. All three distributions show sizable shifts in 
their overall normalization, for \x = Ht- 

Fig. [20] displays distributions for the di-jet masses in W~ + 3-jet production. The three 
plots in the figure give the di-jet mass of the first and second, first and third, and second and 
third leading jets, denoted by where i and j label the jets. Although our default choice 
of scale \x = Ht does significantly reduce the shape changes between LO and NLO compared 
with the choice \x = made for the Tevatron (see fig. [7]), significant shape changes remain 
for the M 12 distribution. For the other two cases the ratio between LO and NLO is rather 
flat. These features have parallels in the ARij distributions in fig. [TU the physics of the two 
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FIG. 20: The di-jet masses in W + 3-jet production at the LHC; My (in GeV) is the invariant 
mass of the i-th and j-th leading jets, ordered in Et- 



leading jets is not modeled especially well at LO. 



VI. LEPTONS AT THE LHC 

We now turn from hadronic observables to leptonic ones. At the LHC, the latter distri- 
butions depend strongly on whether a W + or a W~ boson has been produced. 

Fig. [21] shows the pseudorapidity distributions of the daughter charged leptons. Because 
of the large-x excess of u quarks over d quarks, the qg initial state produces W + preferentially, 
and tends to produce them more forward; this fact accounts for the larger and more forward 
positron distribution. The lower panels show that in this case, the NLO corrections modify 
primarily the overall normalization of these distributions, with only a slight change in shape 
from LO to NLO. 

In the right panel of fig. [J6]we showed the positron transverse momentum distribution in 
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FIG. 21: The charged-lepton pseudorapidity distribution at the LHC for W and W + production. 
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FIG. 22: The left panel shows the ratio of the charged-lepton Et distributions at the LHC for W + 
and W~ production, evaluated at NLO, while the right panel shows the corresponding ratio for 
the neutrino Et, or equivalently the missing transverse energy. 

iy + + 3-jet production at the LHC. In order to contrast the distribution with the correspond- 
ing distribution in W~ + 3-jet production, fig. [22] shows the ratio of the NLO transverse 
energy distributions for the W ± boson decay products, charged leptons in the left panel and 
neutrinos in the right panel. The plots show dramatic differences between the W + and W~ 
distributions, especially for the neutrino Et, or missing transverse energy. The left panel 
shows a large ratio for W + to W~ at small E T which declines at larger E T . In contrast, the 
corresponding ratio for the neutrino Et, or equivalently the missing transverse energy $ T 
in the event, starts with a somewhat smaller value but increases rapidly with Et- This sig- 
nificant increase means that the W + + n-jet background to missing-energy-plus-jets signals, 
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when a charged lepton is lost, is more severe than the W~ + n-jet background by a factor 
of two to three. 

This disparate behavior is presumably due to a net left-handed polarization for high 
W ± bosons, which is then analyzed by their leptonic decay via the parity-violating charged- 
current interaction. Before discussing this situation further, it is useful to recall the dynamics 
underlying the longitudinal (rapidity) charge asymmetry in W production at the Tevatron, 



and the corresponding asymmetry for the charged lepton into which the W boson decays [88]. 
At a pp collider, the dominance of u quarks over d quarks implies that in the process 
ud — > W + — > e + v e the W + typically moves in the u quark (proton) direction. Because the 
charged current is left-handed, the u quark must be left-handed, and the d anti-quark right- 
handed. In order to conserve angular momentum, the W + must be polarized left-handed 
along its direction of motion, which in this case (at low transverse energy) is preferentially 
along the beam axis. In the decay W + — > e + z/ e , angular-momentum conservation implies 
that the left-handed W + tends to emit the left-handed neutrino forward, and the right- 
handed positron backward, relative to its direction of motion. The same arguments show 
that the W~ typically moves in the anti-proton direction, is polarized right-handed, and 
tends to decay with the left-handed electron backward relative to its direction of motion. 
Both signs of charged leptons are typically more central than are their parent W bosons. 
In other words, there is a large asymmetry in the rapidity distribution of W + bosons at 
the Tevatron, and a strongly diluted asymmetry in the rapidity distribution of the charged 



decay lepton [88l. 189]. 

Now consider a pp collider (the LHC) and bosons moving with large momentum 
primarily transverse to the beam required to produce the large E T tails for the 

decay lepton distributions shown in fig. [22j Suppose that both W + and W~ bosons are 
polarized left-handed, with a polarization that increases with E^ . Then the W + will tend 
to emit the left-handed neutrino forward relative to its direction of motion (resulting in 
a larger transverse energy) and the right-handed positron backward (smaller transverse 
energy). In contrast, the W~ will emit the left-handed electron forward. Such decays will 
produce an enhancement in the neutrino E? distribution and a depletion in the charged 
lepton distribution, for W + relative to W~, consistent with the ratios displayed in fig. [221 
We have checked that the same distributions as shown in fig. [221 but for W + 1,2-jet 
production, are very similar. Also, the LO ratios in all cases are virtually indistinguishable 
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from the NLO ones. We examined the LO ratios, removing all lepton acceptance cuts, and 
the same general trends persist (in fact they are even stronger at moderate lepton Et)- The 
left-handed polarization of both W + and W~ is also indicated by the corresponding ratios 
of W transverse momenta (not shown). These ratios grow monotonically with the TV's px, 
but at a lower rate than the ratio for the neutrino Ej<. This growth reflects the fact that the 
larger the W transverse momentum we require, the larger is the required parton momentum 
fraction x, and thus the more W + is favored over W~ by the stiffer u quark distribution. 

We do not have a complete understanding of why the W bosons should be polarized left- 
handed at large transverse momentum, in a manner that is apparently fairly independent 
of the number of recoiling jets. For W + 1-jet production at LO, it is possible to examine 
the relevant helicity amplitudes and make such an argument, based on kinematics and on 
the dominance of the qg initial state. It is also the case that when a very high transverse- 
momentum (left-handed) quark splits collinearly to another quark and a W boson, with the 
W boson taking most of the momentum, the W boson is predominantly left-handed. How- 
ever, these examples certainly do not exhaust all of the possible polarization mechanisms, 
and a more thorough explanation would require further study. 



VII. JET-EMISSION PROBABILITY AT INCREASED PSEUDORAPIDITY 
SEPARATION 

One of the production mechanisms for the Higgs boson at the LHC is via vector-boson 



fusion [90], which contains partonic subprocesses such as qQ — > q'Q'H, mediated by the 
fusion of two W bosons. Because the Higgs is produced via colorless electroweak vector 
boson exchange, a relative absence of radiation is expected between the two forward tagging 
quark jets, in comparison with QCD background processes with color exchange. A veto on 
central jets may play an important role in such searches. For this veto to be effective, the 
background processes should retain a substantial probability of additional radiation, as the 
pseudorapidity separation Ar] between two of the jets becomes large. The production of a W 
boson in association with jets is a prime example of a background-type process dominated 
by color exchange at LO. In fig. 61 of ref. js6], a similar question was studied in W + 2-jet 
and W + 3-jet production at the Tevatron, by looking at the probability of finding a third 
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jet in the acceptance as a function of the pseudorapidity separation of the leading two jets, 
ordered by transverse energy. In that figure, CDF data was compared with a leading-order 
QCD prediction. 
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FIG. 23: The ratio of the inclusive W + + 3-jet to W + + 2-jet cross sections at NLO, as a function 
of the pseudorapidity separation A7712 between the leading two jets at the LHC. The solid (black) 
line gives the NLO result, while the dashed (blue) line represents the LO result. The ratio for W~ 
is very similar, particularly at NLO. 



A more appropriate distribution for assessing the effectiveness of a central jet veto would 
be to order the jets in pseudorapidity, not Et, and place an additional constraint that the 
third jet be between the two most widely-separated jets in pseudorapidity, but here we match 
the choice made in ref. [86j. In fig. [23] we plot the ratio of the W + + 3-jet to W + + 2-jet cross 
sections at the LHC, as a function of the pseudorapidity separation of the leading two jets, 
ordered in Et- This ratio measures the emission probability of a third jet. The solid line 



gives the NLO equivalent of the LO Tevatron results in fig. 61 of ref. |86j. As was found at 
the Tevatron, the emission probability is substantial, over 20%, and remarkably independent 
of A7712. Although we plot only the emission probability for jets accompanying a W + , the 
corresponding plot for W~ is essentially indistinguishable from it at NLO. (The difference 
between the LO and NLO results for W~ is smaller than the difference shown in fig. [231 for 
W+.) 
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VIII. SUBLEADING-COLOR TERMS 



In this section, we turn our attention to the question of simplifying a computation by 
taking advantage of the structure of color sums. As explained in section [Til we can organize 
the matrix elements — leading-order, real-emission, or virtual — in an expansion in 1/N C . We 
expect higher-order terms in this expansion to give smaller contributions numerically; but 
there are more of them, and their structure is more intricate than that of lower-order terms. 
In general they take significantly more computer time per event to evaluate. Although one 
could simply drop these contributions once they have been shown — preferably by direct 
computation — to be negligible or reliably estimated, we shall describe how to reduce the 
computer time their direct computation would entail. 



In ref. 



34|, we used a particular type of "leading-color" approximation (LC NLO), in 



which a subset of subleading-color terms were dropped. In the real-emission contributions, 
as well as in the real-subtraction terms, we retained all terms in the color expansion; the same 
was also true for the singular terms in the virtual matrix elements. The only approximation 
was within the finite virtual terms. Here, by "finite" we mean the e° term in the Laurent 
expansion of the infrared- divergent one-loop amplitudes in e = (4 — D)/2, after extracting 
a multiplicative factor of cr(e) (defined in eq. (1A2|) ). 

The approximation was defined by first dropping the subleading-color terms, that is those 
suppressed either by powers of 1/N% (including those coming from leading-color partial 
amplitudes) or Uf/N c (the latter arising from virtual quark loops), in the ratio of the finite 
virtual terms to the tree-level cross section. In a second step, we multiply this truncated ratio 
by the tree cross section, with its full color dependence. The net effect of this approximation 
is to drop quark loops and subleading-color terms in the finite virtual terms that have a 
different kinematic structure than those at tree level, while retaining the subleading-color 
terms that have the same kinematic structure. This approximation turns out to be a much 
better estimate than a strict leading-color approximation (dropping all subleading-color 
terms in the real-emission terms as well), while still simplifying the calculation considerably 
by eliminating the need to compute primitive one-loop amplitudes that contribute only to 
subleading-color terms, such as those shown in fig. [2j 

Table IVIIII compares NLO results for the total cross sections at the Tevatron with the 
experimental setup as in eqs. (EE]) and (El except for the tighter jet cut, > 25 GeV. 
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number of jets 


CDF 


LC NLO 


NLO 


1 


53.5 ±5.6 


58.3(0.1)1^ 


57.83(0.12)1^00 


2 


6.8 ± 1.1 


7.81(0.04)1°^ 


7.62(0.04)1°;| 


3 


0.84 ±0.24 


0.908(0.005)1°;°^ 


0.882(0.005)±g;?;g 



TABLE VIII: Comparison of LC NLO to full NLO for the total inclusive cross sections in pb 



35| (E, 



nth- jet 



> 25 GeV) and the 



of W + n-jet production at the Tevatron using CDF's cuts 
SISCone algorithm. For reference (see also table [II]), the first column gives the CDF data. The 
second column shows the LC NLO results and the third column the complete NLO results. 



cut 


W~ LC NLO 


W- NLO 


W + LC NLO 


W + NLO 


E ] t > 30 GeV 


28.17(0.13)1°^ 


27.52(0.14)1^ 


42.33(0.27)1*;! 


41.47(0.27)11$ 


E^ > 40 GeV 


14.24(0.07)1?;^ 


13.96(0.07)1^ 


22.08(0.15)1^ 


21.76(0.15)11;! 



TABLE IX: Comparison of the total cross sections, in pb, between LC NLO and full NLO results 
for W + 3-jet prediction at the LHC with y/s = 14 TeV, using the SISCone algorithm and the 
cuts of eqs. ((53]) and ([5T2]) . 

O" 



34] 



For reference, we also show the corresponding CDF data. The column labeled "LC N 
contains the results computed using the specific leading-color approximation of ref. 
The last column gives the full NLO result, incorporating all subleading-color terms. Pre- 
viously (34!, we showed explicitly that this approximation is very good for W + 1,2-jet 
production at the Tevatron, leading to errors of no more than three percent. The entry for 
W+ 3 jets is new, and demonstrates that just as for W+ 1-jet and W+ 2-jet production, the 
LC approximation is excellent, shifting the total cross section by just under three percent. 
This shift is much smaller than the NLO scale dependence. In all cases, the LC NLO and 
complete NLO result are both in excellent agreement with the data. 

Table [IX] shows results for the total cross section of W + 3-jet production at an LHC 
energy of 14 TeV, using the cuts given in eqs. (15.11) and (|5.2p . In this case the column labeled 
"LC NLO" refers to an LC approximation that is slightly modified from the one used for 
the Tevatron. We avoid rescaling the leading-color virtual contributions by the ratio of the 
full-color to leading-color cross section; this allows us to simply add together the "leading- 
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color" and remaining "subleading-color" contributions to obtain the total cross section. This 
modification has only a small effect on the corresponding total cross section. Indeed, for 
Tevatron cross sections the shift is under 1.5 percent throu gho ut the scale- variation band. 
(To facilitate comparison to our previously published results 34J, for Tevatron cross sections 
we use the identical LC approximation as in that reference.) 

The modified LC approximation is again accurate to three percent for central values, and 
to five percent for the upper or lower edges of the scale-dependence bands. The good quality 
of the LC approximation also holds for all distributions we have examined. Examples are 
shown in figs. EH and EHJ f° r both the Tevatron and the LHC, the corrections due to the 
subleading-color terms are less than three percent, uniformly across the distributions. 
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FIG. 24: Ratios of our leading-color approximation to a full-color calculation for Et distributions 
in W + 2, 3-jet production at the Tevatron. The numerical integration errors on each bin are 
indicated by thin vertical lines. (Different error sources are combined linearly.) 
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FIG. 25: Ratios of our leading-color approximation to a full-color calculation for the leading-dijet 
invariant mass distribution in W~ + 3-jet production, and for the Ai?i2 distribution in W + + 3-jet 
production, at the LHC. 



The quality of this approximation has important implications for organizing the calcula- 
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tion of virtual contributions. As discussed in section III B\ the primitive amplitudes entering 



the leading-color approximation used in ref. 



34| and discussed above are a small subset of 



the primitive amplitudes required for the complete virtual correction. The full result, includ- 
ing subleading-color terms, requires the computation of a much larger number of primitive 
amplitudes, 336 for two-quark processes and 80 for four-quark processes 7 as opposed to 48 
for two-quark processes and 8 for four-quark processes in the leading-color computation. 
Furthermore, the subleading-color primitive amplitudes shown in fig. [2] have a more compli- 
cated analytic structure than the leading-color ones shown in fig. [TJ because the W does not 
have to be ordered with respect to parton(s) emitted between the W boson and the q or q'. 
The partial lack of color-ordering implies cuts and poles in more channels. As a result, it 
takes about 27 times longer per phase-space point to evaluate the subleading-color virtual 
terms than the leading-color ones. 

This large factor may seem to be a cause for concern. However, the smallness of the 
subleading-color contributions, discussed above, comes to our rescue. If we require that the 
numerical integration errors due to the subleading-color contributions be comparable to those 
coming from the leading-color ones, we can allow for larger relative errors in the evaluation 
of the subleading-color terms. We can adopt a "color-expansion sampling" approach wherein 



we can use far fewer phase-space points (typically a factor of 20 fewer) to evaluate them [49] 
as compared to the leading-color terms. (We must ensure that there are sufficient statistics 
in each bin of every distribution of interest, of course.) There is no need to know ahead of 
time what the relative size of the two contributions is; we simply stop the integration when 
the desired numerical precision is reached for each contribution separately. This approach 
requires only a bit more than a factor of two more computer time for the full-color result 
than for the leading-color approximation, with our present setup. It saves roughly a factor 
of thirty in computer time, compared to the naive approach of evaluating the subleading- 
color terms at every phase-space point. We expect to obtain further improvements in the 
evaluation efficiency of subleading-color contributions through improved re-use of primitive 
amplitudes. Together with color-expansion sampling, this should reduce the time required 
for computing the subleading-color terms to a small fraction of the total computer time. 
This color-expansion sampling approach would naturally be implemented in a dynamical 



7 The count for the four-quark case is for non-identical fermions; for identical fermions, multiply by 2. 
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way by treating the subleading-color contributions as another set of 'subprocesses' within 
the SHERPA multi-channel integration. 

Were we seeking to optimize the computer time more aggressively, we would let the 
total error be dominated by the most time-consuming part of the calculation, namely the 
subleading-color terms. We have, however, opted for a more conservative approach, keeping 
the integration error from the subleading-color contributions in line with the errors from the 
other contributions. 



IX. CONCLUSIONS 

In this paper, we presented the first complete NLO study of W + 3-jet production, 
incorporating all massless partonic contributions. We compared the total cross section and 



the third-jet Et distribution to Tevatron data [35|. We also presented NLO predictions 
for the Ht and di-jet mass distributions. It will be interesting to compare these and other 
distributions to Tevatron data sets with larger statistics than those already published by 



CDF 35]. We presented a variety of distributions at the declared final running energy of 
the LHC, including many relevant for Standard Model backgrounds to events with large 
missing energy and to Higgs boson production via vector-boson fusion. As expected, we 
find a much smaller renormalization- and factorization-scale dependence in all distributions 
at NLO, compared to LO results. Although the LHC will start running at lower energy, our 
choice should help facilitate comparisons to earlier studies based on leading-order QCD and 



matching to parton showers 
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9l|. 



We have shown explicitly in W + 3 jet distributions that the scale dependence of LO 
predictions is not restricted to overall normalizations. An infelicitous choice of scale can 
change the shapes of distributions substantially between LO and NLO. This effect is much 
more pronounced at the LHC than at the Tevatron. One can reduce the change in shape 
of distributions between LO and NLO by choosing a scale dynamically, event b y e vent, 



corresponding to a typical scale for the event, as noted in, for example, refs. [14J, ll6|, l38 |. 
The problem with poor scale choices can be much more severe than just changes in shape 
between LO and NLO results. Indeed, for sufficiently poor choices, such as the fixed scale 



\x = Mw or the transverse energy of the W boson, /i = E? , large logarithms can appear in 
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some distributions, invalidating even an NLO prediction. We find that the total (partonic) 
transverse energy Ht is a more appropriate scale choice for W + 3-jet production than the 
W transverse energy or the fixed scales used in previous Tevatron analyses. (A fixed fraction 
of the total transverse energy would also be appropriate.) We expect that this scale choice 
will be appropriate to a variety of higher-multiplicity processes, and recommend its use in 
LO predictions (when an NLO one is not available) as well as at NLO. A recent paper 
motivates a similar type of scale choice using soft-collinear effective theory, and we have 
contrasted its properties with those of H T . Of course, a simple scale choice is no substitute 
for a complete NLO prediction. In some distributions, such as the transverse energy of the 
second-most energetic jet and the AR separation between the two leading jets, the NLO 
calculation incorporates physics effects that are not captured by simple changes of scale. 

341. which 



We also confirmed that our previous NLO analysis of W + 3-jet production 
used a specific leading-color approximation, is valid to within three percent. This error is 
quite a bit smaller than other uncertainties, such as that implicit in the scale dependence, 
or that due to uncertainties in the parton distribution functions. However, we can draw 
this conclusion only after computing the subleading-color terms, as we have done here. To 
evaluate the subleading-color terms efficiently, we used "color-expansion sampling". The 
subleading-color terms require much more computer time per phase space point. However, 
because they are small, only a few percent of the leading-color ones, we can tolerate a much 
larger relative error for them from the Monte Carlo integration, thus sampling them much 
less often. We expect that this general approach will be an effective technique for reducing 
the computer-time requirements for ever-more complicated processes such as Z + 3-jet, 
W + 4-jet, or Z + 4- jet production. 

In our analyses we mainly used the SIS Cone jet algorithm; we also presented total cross 
sections using the kr jet algorithm 87| at the LHC. These jet algorithms are infrared-safe 
to all orders in perturbation theory. With our setup it is a simple matter to replace one 
infrared-safe cone algorithm with any other desired one. We defer a study of the anti- 
kx algorithm 9^], which has certain experimental advantages such as uniform catchment 
areas for soft radiation, to future work. From a perturbative viewpoint, infrared safety 
is essential; infrared-unsafe quantities are simply logarithmically divergent. In the real 
world, perturbation theory does not go on forever but is overtaken by non-perturbative 
dynamics around the confinement scale. Infrared-unsafe quantities are not infinite, but the 
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infinities are cut off and replaced by quantities determined by non-perturbative physics. 



The logarithms translate 



37 



93| into inverse powers of the strong coupling as, thereby 



spoiling the perturbative expansion. This is an important practical problem because the 
jet algorithms traditionally used at the Tevatron by the CDF and DO collaborations are, in 
fact, infrared-unsafe beyond the lowest orders 37J. Unknown nonperturbative corrections 
for these algorithms would undo many of the benefits of a higher-order prediction, especially 
in the context of new, higher statistics data. Accordingly, it is highly desirable that future 
experimental analyses at both the Tevatron and the LHC use an infrared-safe jet algorithm. 

Our paper demonstrates the utility of on-shell methods for computing one-loop matrix 
elements entering state-of-the-art NLO QCD predictions for processes of phenomenological 
interest at the LHC. We used BlackHat, an efficient new code library based on these 
methods. The NLO W + 3-jet results reported here also demonstrate the functionality 
of our computational setup, which uses BlackHat in conjunction with the SHERPA 
package. Besides handling the real-emission contributions and infrared- singular phase space 
via AMEGIC++, the SHERPA framework offers a convenient set of tools for integrating 
over phase space and analyzing the results. 

There are many relevant processes with large numbers of final-state objects such as jets 
that remain to be computed, especially those involving vector bosons, jets, heavy quarks 
and Higgs bosons [9]. Such processes are backgrounds to the production of new heavy 
particles with multi-body decays. Our setup is robust enough to deal systematically with 
such processes. In the present paper, we have demonstrated the new tools and on-shell 
methods at work for the non-trivial case of W + 3-jet production at hadron colliders. We 
look forward to comparing our predictions against forthcoming LHC data. 



Acknowledgments 

We thank Jeppe Andersen, Christian Bauer, John Campbell, Keith Ellis, Beate Heine- 
mann, Joey Huston, Pavel Nadolsky, Michael Peskin, Gavin Salam, Rainer Wallny and 
Giulia Zanderighi for helpful discussions. We especially thank Costas Papadopoulos and 
Roberto Pittau for assistance in comparing results for the virtual contributions to squared 
matrix elements. This research was supported by the US Department of Energy under 



55 



contracts DE-FG03-91ER40662, DE-AC02-76SF00515 and DE-FC02-94ER40818. DAK's 
research is supported by the European Research Council under Advanced Investigator Grant 
ERC-AdG-228301. Hi's work is supported by a fellowship from the US LHC Theory Initia- 
tive through NSF grant PHY-0705682. This research used resources of Academic Technology 
Services at UCLA, PhenoGrid using the GridPP infrastructure, and the National Energy 
Research Scientific Computing Center, which is supported by the Office of Science of the 
U.S. Department of Energy under Contract No. DE-AC02-05CH11231. 



APPENDIX A: SQUARED MATRIX ELEMENTS AT ONE POINT IN PHASE 
SPACE 

In order to aid future implementations of virtual corrections for W + 3-jet production in 
other numerical codes, we present values of the one-loop virtual corrections to the squared 
matrix elements, d<jy\ at one point in phase space. This comes from the interference 
between the tree and one-loop amplitudes, summed over all colors and helicities, for N c = 3 
and nf = 5 massless quark flavors. 

In table |X] we present numerical values for four representative subprocesses. All other 
subprocesses are related to these four by crossing symmetry. In the second and third lines of 
table 1X1 the presence of two identical quarks (after crossing all particles into the final state) 
means that amplitudes are antisymmetrized under exchange of the two. 

We quote numerical results for the ultraviolet-renormalized virtual corrections in the 



't Hooft-Veltman variant of dimensional regularization [94j. The remaining singularities in 
the dimensional regularization parameter e = (4 — D)/2 arise from the virtual soft and 
collinear singularities in the one-loop amplitudes. 

The quoted values are for the ratio of the virtual corrections to the tree-level squared 
matrix element da^°\ Explicitly, we define the ratio, 

" — TTT~7Fi!" ' (Al) 

v 87ra s c r (e)c/a(°) ' V ; 

where we have also separated out the dependence on the strong coupling as and the overall 
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-—-(1) 
da v 


1/6 2 


1A 


e° 


(lu 2 C -> 3 c 4j5g6 e -7p) 


-8.333333333 


-32.37677210 


1.778061330 


(Is 2 U > 3 n 4j 5 9 6 e - 7p) 


-8.333333333 


-32.40807165 


1.035000256 


2 d ^3 d 4 d -5 g 6 e -7 p ) 


-8.333333333 


-32.50750136 


0.4788030624 


(1« 2 9 3 ff 4 9 5 d -6 e -7 p ) 


-11.66666667 


-42.34303628 


-13.97991225 



TABLE X: Numerical values of the normalized virtual correction to the squared matrix elements, 
d(Ty\ at the phase-space point given in the text, for the four basic partonic subprocesses for W + 3- 
jet production at a hadron collider. We give the finite parts along with the coefficients of the poles 
in e. 



factor cr(e), defined by 



cr(e) 



1 r(i + e)r 2 (i- e ) 



(A2) 



(4vr) 2 - e T(l - 2e) 

The coupling constants, mass and width of the W boson are given in section III G[ How- 
ever, the numerical values for the ratio flAll) . given in table |Xl are independent of these 
parameters; coupling constants as well as the W boson Breit-Wigner factor cancel between 
the tree and virtual correction terms. 



We choose the phase-space point given in eqs. (9.3) and (9.4) of ref. |61| . 



h 
h 

k 5 

K 

k 7 



1, — sin 9, — cos 9 sin (f>, — cos 9 cos < 



2 
2 

I' 
I' 

10 

Bl 

12 

h + k 2 - h - h - h - k e 



[1, sin 9, cos 9 sin <p, cos 9 cos <p) , 
1,1,0,0), 
1, cos P, sin p, 0) , 
(1, cos a cos P, cos a sin P, sin a) , 
(1, cos 7 cos P, cos 7 sin P, sin 7) , 



where 



7T 2tt 37 

a = — , 7 = — , cos P = 

3 3 ' H 128 



and the renormalization scale [i is set to \i = 7 GeV. We have flipped the signs of k\ and 
k 2 compared to ref. jfill, to correspond to 2 — > 5 kinematics, instead of — > 7 kinematics. 
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The labeling of the parton and lepton momenta is indicated explicitly in the first column of 
Table IS 
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